1 Lagrangian Methods 



1.1 Lagrangian methods 

Let P(b) denote the optimization problem 

minimize f(x) , subject to h(x) = 6, x G X . 

Let x G X(b) = {x G X : h(x) = 6}. We say that x is feasible if x G X(6). 
Define the Lagrangian as 

L{x,\) = f{x)-\ T {h{x)-b). 

Typically, I Q", /i : T h R™ with Met™ Here A is called a Lagrangian 
multiplier. 

Theorem 1.1 (Lagrangian Sufficiency Theorem) If x is feasible for P(b) 
and there exists A such that 

inf L(x, A) = L(x, A) 

then x is optimal for P(b). 

Proof of the LST. For all x G X(b) and A we have 

f(x) = f(x)-\ T (h(x)-b) = L(x,\). 
Now x G X(b) C X and so by assumption, 

f(x) = L(x, A) < L(x, A) = f(x) , for all x G X(b) . 
Thus x is optimal for the optimization problem P(b). ■ 

Example 1.1 Minimize x\ + x\ subject to a\X\ + 02^2 = > 0. 

Here L = + £2 — A(ai#i + 0.2^2 — b). We consider the problem 

minimize \x\ + x 2 , — \{a\X\ + 02^2 — b)] . 

Xi,X2>0 

This has a stationary point where (^1,^2) = (Aai/2, Aa2/2). Now we choose 
A such that ai^i + 0,2^2 = &■ This happens for A = 2b/(a 2 + a|). We have 
a minimum since d 2 L/dx 2 > 0, d 2 L/dx\dx2 = 0. Thus with this value of A 
the conditions of the LST are satisfied and the optimal value is b 2 / (a 2 + a 2 ) at 
(xi,x 2 ) = (aib, a 2 b)/{a\ + af). 
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1.2 The Dual Problem 

Let us define 



Then for all A 



4>(b) = inf f{x) and g(X) — inf L(x, X) 



0(6) = inf L(x, A) > inf L(x, A) = g{\) . (1.1) 



Thus g(A) is a lower bound on 0(6), i.e., a lower bound on the solution value of 
P(b). As this holds for all A it is interesting to make this lower bound as large 
as possible. Of course we can restrict ourselves to A for which g(X) > — oo. This 
motivates the dual problem, defined as 

maximize g(X) , subject to A 6 Y , 

where Y = {A : g{X) > — oo}. In (1.1) we have a proof of the so-called weak 
duality theorem that 

inf ■ f(x)>m^g(X). (1.2) 
The left hand side of (1.2) poses the primal problem. 



1.3 Strong Lagrangian 

We say that P{b) is Strong Lagrangian if there exists A such that 

0(b) = inf L(x,X). (1.3) 

In other words, P{b) is Strong Lagrangian if it can be solved by the Lagrangian 
method. But when does this happen? Usually we just try the method and see. 
If we are lucky, as in Example 1.1, then we will be able to establish that there 
exists a A that lets us solve P(b) this way. However, there are important classes 
of problems for which we can guarantee that Lagrangian methods always work. 

Note that (1.3) says that there is a A such that 4>{b) = g(X). Combining this 
with (1.2), we see that if the problem is Strong Lagrangian then min of primal = 
max of dual. 



1.4 Hyperplanes 

Let the hyperplane (c, a) be given by 

a = P-X T (b-c). 
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It has intercept at (3 on the vertical axis through 6, and has slope(s) A. Consider 
the following approach to finding 0(6): 

1. For each A, find f3\ = max/? such that the hyperplane lies completely below 
the graph of 0. 

2. Now choose A to maximize f3\. 




b b 



Lagrangian methods work in Case 1 because of the existence of a tangent to 
at b. Define a supporting hyperplane (c, a) at b as 

a = 0(6) - X T {b - c) , where 0(c) > 0(6) - A T (6 - c) for all c G R m . 

In fact, (3\ = g(X) = min x€ x L(x, A). To see this, we argue 



g(X) = inf L(x, A) 



inf inf [fix) - X T (h(x) -6)1 



inf [0(c)-A T (c-6)] 

c€n m 



sup{/3 : /3 - A T (6 - c) < 0(c) , for all c G R m } 
/3a- 



Hence, the dual problem is maxfl\. Again, we see the weak duality result of 
max/?A < 0(6), with equality if the problem is Strong Lagrangian. 

Theorem 1.2 The following are equivalent: 

(a) there exists a (non-vertical) supporting hyperplane to at b; 

(b) the problem is Strong Lagrangian. 



Hyperplanes 
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This is important since a (non- vertical) supporting hyperplane exists if (f)(b) is a 
convex function of b. We can find conditions that make (ft convex. 

Proof. Suppose there exists a (non-vertical) supporting hyperplane to (ft at b. 
This means that there exists A such that 

T 



(ft{b) - X (b — c) < (ft(c) for all c G 



This implies 



<f>(b)< mf [(ft(c) - \ T (c - b)] 



T 



= inf inf \f(x) - X T (h(x) -6)1 
= inf L(x, A) 
= ^(A) 

However, we have the opposite inequality in (1.1). Hence (ft{b) = g(X). This 
means that P(b) is Strong Lagrangian, i.e., can be solved by minimizing L(x,X) 
with respect to x. 

Conversely, if the problem is Strong Lagrangian then there exists A such that 
for all x G X 

(ft{b) < f{x)-X T {h{x)-b). 

Imagine minimizing the right hand side over x G X(c), where h(x) = c. This 
gives 

4>{b) < (ft{c) - X T {c-b). 

This is true for all c, and hence 

cft{b) -X T {b-c)< cft{c) for all c G R m . 
Hence, (ft has a (non-vertical) supporting hyperplane at b. ■ 



2 Linear Programming 



2.1 Convexity and Lagrangian methods 

1. A set S is a convex set if for all < 6 < 1 

x,y G S =>- 5x + (1 — S)y G 5 . 

2. A real- valued / is a convex function if for all x, y G S and < 5 < 1 

<J/(z) + (1 - 5)f(y) > f(8x + (1 - 5)y) . 

3. A point x is an extreme point of S if whenever 

x = 5y + (1 — £)z 
for some y,z E S and < (5 < 1 then x = y = z. 

Theorem 2.1 (Supporting Hyperplane Theorem) Suppose <fi is convex and 
b lies in the interior of the set of points where (f) is finite. Then there exists a 
(non-vertical) supporting hyperplane to (p at b. 

So, we are interested in conditions on the problem that make <fi convex. 

Theorem 2.2 Consider the problem P{b), defined as 

minimize f (x) subject to h(x) < b . 

If X is a convex set and f and h are convex then (p is convex. 

Proof. Take bi, 62 and b = 5b\ + (1 — <5)£»2 for < 5 < 1 with b\, 62 such that (ft is 
defined. Take Xi feasible for P(bi) for i = 1, 2 and consider x = 8x\ + (1 — S)x2- 
Then X convex, xi, X2 G X implies that x G X. Also, h convex gives 

h(x) = h(Sxi + (1 — 5)x2) 

< 5h{x 1 ) + (1 - S)h{x 2 ) 

< Sh + (1 - S)b 2 
= b. 

So x is feasible for P(b). So, if / is convex 

m < f{x) = f(5 Xl + (1 - 5)x 2 ) < 5f( Xl ) + (1 - 8)f(x 2 ) . 
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This holds for all x\ G X(b\) and X2 G Xfo) so taking infimums gives 

cf>(b) < J0(6i) + (1 - £)0(6 2 ) 

so that ^ is convex. ■ 

Remark. Consider the constraint h(x) = 6. This is the same as h(x) < b and 
—h(x) < —b. So (ft is convex under these constraints if X is a convex set and /, 
h and —h are all convex. Thus h should be linear in x. 



2.2 Linear programs 

We will study problems of the form 1 

minimize |c T x : Ax < b , x > 0} 

where x and c are n-vectors, 6 is a m-vector and A is a m x n matrix. Such 
problems are also written out in the longer form 

minimize c T x , subject to Ax < b , x > . 



Example 



minimize — (x\ + £2) 
subject to 

X\ + 2lE2 < 6 
X\ — X2 < 3 

xi, £2 > . 



xi = 



£1 + £2 =const 




x 2 = 



#1 + 2^2 = 6 



2.3 Duality of linear programs 

The primal LP optimization problems 

(LP =) : minimize{c T x : Ax = 6, x > 0} 
(LP >) : minimize{c T x : Ax > 6, x > 0} 



1 For a thorough introduction to the topic of linear programming see Richard Weber's course 
on Optimization, available at: http://www.statslab.cam.ac.uk/ rrwl/opt/ 
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have corresponding dual problems 

Dual of (LP =) : maximize{6 T A : ^4 T A < c} 

Dual of (LP >) : maximize{6 T A : ^ T A < c, A > 0} 

2.4 Derivation of the dual LP problem 

Consider (LP >), and introduce slack variables z to form the problem 

minimize c T x , subject to Ax — z = 6, £ > 0, z > . 

So the set X C M m+n is given by 

X = {(x,z) : x > 0, z > 0} . 

We use a Lagrangian approach. The Lagrangian is 

L ((x, z); A) = c T x - A T (Ax - z - b) = (c T - A T ^) x + X T z + A T 6 

with finite minimum over (x, G X if and only if 

A g Y = {A : A > 0, c T - A T ^ > 0} . 

The minimum of L((x, z)\ A) for A G y is attained where both (c T — A T ^4) x = 
and A T z = 0, so that 

g(X) = inf L(x;A) = A T 6. 
Hence form of dual problem. 

2.5 Shadow prices 

The Lagrange multipliers play the role of prices since we have that 

d(p _ dg(X) = 

The variables A^ are also known as shadow prices. 

2.6 Conditions for optimality 

For the (LP >) problem, x and A are primal and dual optimal respectively if and 
only if x is primal feasible, A is dual feasible and, in addition, for any i = 1, . . . , n 
and j — 1 , . . . , m 

(c T - A 7 A)iXi = = Xj(Ax - b)j . 
These are known as the complementary slackness conditions. 



Basic insight 



8 



2.7 Basic insight 

If an LP has a finite optimum then it has an optimum at an extreme point of the 
feasible set. 

There are a finite number of extreme points so an algorithm for solving the 
LP is 

• Find all the vertices of the feasible set. 

• Pick the best one. 

Our example has an optimum at C. However, there are ^ n vertices, so this 

algorithm could take a long time! 

2.8 Basic solutions 

A basic solution to Ax = b is a solution with at least n — m zero variables. The 
solution is non-degenerate if exactly n — m variables are zero. The choice of the 
m non-zero variables is called the basis. Variables in the basis are called basic; 
the others are called non-basic. 

If a basic solution satisfies x > then it is called a basic feasible solution 
(bfs). The following is a theorem. 

The basic feasible solutions are the extreme points of the 
feasible set. 



In our example, the vertices A F are basic solutions (and non-degenerate) and 
A-D are basic feasible solutions. 



3 The Simplex Algorithm 



1. Start with a bfs. 

2. Test whether this bfs is optimal. 

3. If YES then stop. 

4. If NO then move to an 'adjacent' bfs which is better. Return to step 2. 

3.1 Algebraic viewpoint 

A basis, B, is a choice of m non-zero variables. For any x satisfying the constraints 
Ax = 6, we can write 

AbXb + AjyXN = b 

where Ab isamxm matrix, An isamx(n-m) matrix, xb and b are m- vectors 
and xn is a (n — m)-vector. 

A basic solution has xn = and AbXb = b and a basic feasible solution 
has xn = 0, AbXb —b and xb > 0. 

As we have seen, if there exists a finite optimum then there exists a bfs that 
is optimal. 

Nondegeneracy assumptions 

We will assume that the following assumptions hold. (If they do not, then they 
will do so for a small perturbation of the data). 

1. The matrix A has linearly independent rows, i.e., rank(yl) = m. 

2. Any m x m matrix formed from m columns of A is non-singular. 

3. All basic solutions AbXb = b, xn = have exactly m non-zero variables, i.e., 
Xi for i G B. 

3.2 Simplex tableau 

Now for any x with Ax = 6, we have xb = A^ 1 ^ — A^xn). Hence, 

f(x) = c T x = c^xb + c n xn 

= CBA^(b - A N x N ) + c N x N 

= c^A^b + (c N - CqA^A^xn . 

We can assemble this information in a tableau. 
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basic 


non-basic 




I 


A^An 


a-h 





c N — c^A B An 


-clA-'b 



3.3 Test for optimality 

Suppose we want to maximize c T x and we find 

(c N - CbA^An) < and A^b^O. 

Then for all feasible x (since x > =>- xjy > 0) 

f(x) = clA^b + (c N - c T B A B l A N )x N < c\A~ B x b . 

But for bfs x with xb = A^b and xn = we have fix) = CqA b b. So, x is 
optimal. 

This gives us an easy way to check if a given bfs is optimal. 

3.4 Choice of new bfs 

Alternatively, if some {c N — c^A^ 1 A^)i is positive we can increase the value of 
the objective function by increasing from zero the value of [xn)%- 

We would like to increase (xjv)i by as much as possible. However, we need to 
keep the constraints satisfied. So as we alter {xx)i the other variables alter and 
we must stop increasing (xjsr)i if one becomes zero. 

The net effect is that we interchange one basic variable with one non-basic 
variable. 

3.5 Simplex algorithm 

1. Find an initial bfs with basis B. 

2. Check the sign of (c N — c^Ag 1 A^)i for i 6 N. Are all components non- 
positive? 

3. If YES then we are at an optimum. Stop. 

4. If NO, so that (c N — c^A^A^i > 0, say with i £ TV, increase {xjy)i as 
much as possible. 



11 The Simplex Algorithm 

Either we can do this indefinitely, which means the maximum is 
unbounded. Stop. 

or one of xb variables becomes zero, giving a new new bfs. Repeat from 
step 2. 

3.6 Simplex algorithm: tableau form 

0. Find an initial basic feasible solution. 

The tableau takes the form 







a j 





This is easy when the constraints are of the form 

Ax < b , b > . 

We can write this as 

Ax + z = b, z>0 
and take an initial basic feasible solution of 

x = 0, z = b>0. 

It is best to think of this as extending x to (x, z) and then setting 

(x B ,x N ) = (z,x) = (6,0) . 

1. Choose a variable to enter the basis 

Look for a j such that aoj > 0. Column j is called the pivot column and the 
variable corresponding to column j will enter the basis. If aoj < for all j > 1 
then the current solution is optimal. If there is more than one j such that aoj > 
choose any one. A common rule-of-thumb is to choose the j for which aoj is most 
positive. Alternatively, we could choose the least j > for which a j > 0. 

2. Find the variable to leave the basis 

Choose i to minimize a^jaij from the set {% : > 0}. Row % is called the 
pivot row and aij is called the pivot. If a\j < for all i then the problem is 
unbounded and the objective function can be increased without limit. 



Simplex algorithm: tableau form 
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If there is more than one i minimizing aio/aij the problem has a degenerate 
basic feasible solution. 

In our example we have at this point 





X\ X 2 Z\ z 2 




z\ basic 


12 10 


6 


Z2 basic 


1-10 1 


3 


a j 


110 






3. Pivot on the element 

The purpose of this step is to get the equations into the appropriate form for the 
new basic feasible solution. 

• Multiply row i by 1/ciij. 

• Add —(cikj/ciij) x (vowi) to each row k ^ i, including the objective function 
row. 

The new tableau form: (after re-arranging rows and columns), is as at the end 
of Section 3.6. In our example we reach 





X\ 


x 2 


Z\ 


Z2 




z\ basic 





3 


1 


-1 


3 


x\ basic 


1 


-1 





1 


3 


a 0j 





2 





-1 


-3 



Now return to Step 1. 



In our example, one further iteration brings us to the optimum. 





X\ X 2 Z\ Z2 




x 2 basic 


1 1 - 1 


1 


x\ basic 


1 i I 


4 


a 0j 


o o -1 -i 


-5 



This corresponds to the bfs x\ = 4, X2 = 1, z\ = Z2 = 0, i.e., vertex C. 



4 Advanced Simplex Procedures 



4.1 Two phase simplex method 

Suppose we do not have the obvious basic feasible solution. Consider 



maximize — 6x\ — 3x2 
subject to 



maximize — 6xi — 3x2 
subject to 



Xi + x 2 
2x\ — X2 

3X2 



> l 

> l 

< 2 

> 



2xi - x 2 - z 2 
3x 2 + z 3 

Xl,X2,Zi,Z 2 ,Z 3 



= 1 
= 1 

= 2 
> 



Unfortunately, the basic solution 
Xo = 



Xi 



Zl 



Z2 



Z3 



is not feasible. The trick is to add artificial variables, yi,y 2 to the constraints 
and then minimize y\ + y2 subject to 

xi + X2 - zi + yi = 1 

2xi - X 2 - Z 2 + 2/2 = 1 
3x 2 + Z3 = 2 
xi,X2,zi,z 2 ,Z3,yi,y 2 > 

We can take the 'easy' initial bfs of yi — l,y 2 — 1, z 3 = 2, xi — 0, X2 = 0. 

In Phase I we minimize 2/1 + 2/2, starting with 2/1 = 1,2/2 = 1 and z% = 2. 
(Notice we did not need an artificial variable in the third equation.) Provided 
the original problem is feasible we should be able to obtain a minimum of with 
2/1 = 2/2 = (since 2/1 and 2/2 are not needed to satisfy the constraints if the 
original problem is feasible). At the end of Phase I the simplex algorithm will 
have found a bfs for the original problem. Phase II proceeds with the solution 
of the original problem, starting from this bfs. 

Note: the original objective function doesn't enter into Phase I, but it is 
useful to carry it along as an extra row in the tableau since the algorithm will 
then arrange for it to be in the appropriate form to start Phase II. 

We start with 





Xi 


x 2 


Zl 


Z2 


Z3 


2/i 


2/2 




2/1 


1 


1 


-1 








1 





1 


2/2 


2 


-1 





-1 








1 


1 


Z3 





3 








1 








2 


Phase II 


-6 


-3 




















Phase I 

















-1 


-1 
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Preliminary step. The Phase I objective must be written in terms of the non- 
basic variables. This is accomplished by adding rows 1 and 2 to the bottom row, 
to give 







x 2 




Z2 




Z3 


yi 


V2 






yi 




1 


-1 










1 







1 


V2 


m 


-1 





-1 










1 




1 







3 










1 










2 


Phase II 


-6 


-3 






















Phase I 


3 





-1 


-1 




A 

u 


n 
U 


A 

u 


2 


Begin Phase I. Pivot on 021 to get 














X\ 


X2 


Zl 


^2 




Z3 


yi 


2/2 











3 
2 


— 1 


1 
2 







1 


1 

2 




1 
2 


Xi 


1 


1 

2 





1 
2 










1 




1 
2 


Z3 





3 










1 










2 







-6 





-3 










3 


3 







3 
2 


-1 


1 
2 










_3 
9 


1 
2 


Pivot on <2i4 to get 






















X\ 


x 2 


Zl 


Z2 




Z3 


yi 


2/2 






Zl 





3 


-2 


1 







2 


-1 




1 


X\ 


1 


1 


-1 










1 







1 







3 










1 










2 







3 


-6 










6 





6 





















-1 


-1 






End of Phase I. yi = yi — and we no longer need these variables, and so drop 
the last two columns and Phase I objective row. We have a bfs with which to 
start Phase II, with xi — 1, — 1, Z3 — 2. The rest of the tableau is already 
in appropriate form. So we rewrite the preceeding tableau without the yi , yi 
columns. 

Begin Phase II. 





Xi 


Xi 


Zl 


Zl 


Z3 




Zl 





3 


-2 


1 





1 


Xi 


1 


1 


-1 








1 


Z3 





3 








1 


2 







3 


-6 








6 
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In one more step we reach the optimum, by pivoting on an- 





1* 1 




■y 

-1 


^2 


^3 









1 


2 
3 


1 

3 





1 

3 


X\ 


1 





1 

3 


1 

3 





2 
3 


Z3 








2 


-1 


i 


1 










-4 


-1 





5 



In general, artificial variables are needed when there are constraints like 

< —1, or > 1, or = 1, 

unless they happen to be of a special form for which it is easy to spot a bfs. If 
the Phase I objective cannot be minimized to zero then the original problem is 
infeasible. 

The problem we have solved is the dual of the problem P that we considered in 
Chapters 2-3, augmented by the constraint 3^2 < 2. It is interesting to compare 
the final tableau above with the tableau obtained in solving the primal. They are 
essentially transposes of one another. 



4.2 Primal and dual algorithms 

Consider the problem [LP =), defined as minimize{c T x : Ax = b,x > 0}. This 
has dual maximize{A T 6 : c T — A T ^4 > 0}. At each stage of the primal simplex 
algorithm, we have a tableau, 



basic, xb > 


non-basic, xn = 




/ 


Ab X A n 


A~ B r b > 


c T B - c T B A- l A B = 


c n ~ c b a b 1a n, free 


-clA- l b 



Here we have a basic feasible solution for the primal, xb = A^b, and a basic 
(though not necessarily feasible) solution for the dual, A J = c^A^ 1 . We always 
have primal feasibility and complementary slackness. Recall 

primal 
feasibility 
Ax = b and x > 



dual 
feasibility 
(c T - A T ^4) > 



complementary 

slackness 
(c T - X T A)x = 



> optimality. 



Dual simplex algorithm 
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Primal algorithms maintain primal feasibility and complementary slackness 
and seek dual feasibility. Dual algorithms maintain dual feasibility and com- 
plementary slackness and seek primal feasibility. 

4.3 Dual simplex algorithm 

The dual simplex algorithm starts with and maintains a primal/dual basic 
solution that is dual feasible and satisfies complementary slackness while seeking 
primal feasibility. This can be useful. 

It may be easier to spot a dual feasible solution 



minimize 2x i + 3x2 + 4^3 s.t. 



x\ + 2x2 + x?> > 3 
2x\ — X2 — 3^3 > 4 
xi, x 2 ,x 3 >0 

Note Ci > for all i. Let us add slack variables, Z{ > to obtain 

X\ + 2x2 + x?> — z\ = 3 
2x\ — X2 — 3^3 — Z2 = 4 

The primal algorithm must use two-phases since z\ = —3, Z2 = —4 is not 
primal feasible. However, the tableau contains a dual feasible solution, Ai = A2 = 
0, and c T - A T ^ = (2, 3, 4, 0, 0) > 0. 



-1 


-2 


-1 


1 





-3 




-2 


1 


3 





1 


-4 


2 


3 


4 












Rule: for rows, i, with a^o < pick column j with aij < to minimize aoj/—ciij. 
Pivoting on 021 gives 



and then on a\2 gives 






5 
2 


5 
2 


1 


1 

2 


-1 


1 


1 

2 


3 
2 





1 

2 


2 





4 


7 





1 


-4 


ives 





1 


1 


2 
5 


1 

5 


2 
5 


1 





-2 


1 

5 


2 
5 


11 

5 








3 


8 
5 


1 

5 


28 

5 
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So the optimum is 4p , with x\ = -g-, x<i — g, xs = 0. 

Notice that for problems of the form Ax > 6 we can write 

Ax - z = b z>0 



A 





l-( 


I 





-1 






= b 



Hence 



(c T - X T A) = (2 3 4 0) - (Ai A 2 ) 

• • Ai A 2 ) 



-1 




So the dual variables, A, can be found in the objective function row under the 
slack variables in the optimal tableau. E.g., A = (|, |). 

We may wish to add constraints to optimal solutions 

Suppose we have solved an LP and have the final tableau 

non-basic basic 





I 


+ve 


+ve 








Now we wish to add a new constraint 

a\X\ + a 2 x 2 + 



+ a n x n < b 



If the optimal solution satisfies this constraint the solution remains optimal for 
the new problem. Otherwise, we add it to the tableau to give 














I 





+ve 






1 


b 


+ve 






















I 





+ve 







1 


-ve? 


+ve 











Notice that we still have a dual feasible solution. The problem solution may 
not be primal feasible. However, we can apply the dual simplex algorithm to find 
the new optimum under this additional constraint. 



Dual simplex algorithm 
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Gomory's cutting plane method 

The addition of new constraints is useful in Gomory's cutting plane method 

for integer linear programming. Consider the final tableau on page 16. Pick a 
row in which the far right hand element is not an integer, say row 1. This says 

%2 + x 3 ~~ 5^1 + \ z 2 — § • 

Suppose xi, X2, X3, z\, Z2 are restricted to be non-negative integers. Then we must 
have 

X2 + X 3 - lZi + 0Z 2 < X 2 + X 3 - \z\ + \z 2 = § . 

This is because z±,z 2 > and we have replaced — | and | by the integers that lie 
just below them. 

Since the left hand side is an integer, it can be no more than the integer just 
below |. So a solution in integers must satisfy 

X2 + x 3 - z\ < . 

However, the present solution does not satisfy this, since X2 = |, x 3 = z\ = Z2 = 
0. Thus we can add this new constraint (or cutting plane) to give 



which is 
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3n into standard form 
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Applying the dual simplex algorithm to this, and repeating the process, we 
will eventually arrive at the optimal integer solution. In this example we reach 
the optimum of x\ = 3, X2 = x 3 = in just one more interation. 
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5 Complexity of Algorithms 



5.1 Theory of algorithmic complexity 

An instance of an optimization problem is defined by its input data. E.g., an 
instance of linear programming with n variables and m constraints is described 
by the inputs c, A and b. There are mn + m + n numbers and if all of them can 
be expressed in no more than k bits, the instance can be described in a string of 
(mn + m + n)k bits. This is the instance size. 

An optimization problem is solved by a computational algorithm whose run- 
ning time depends on how it is programmed and the speed of the hardware. A 
slarge instance can be easy to solve, such as LP with A — I. However, in general, 
we expect an algorithm's running time to increase with size of the instance. Ig- 
noring details of the implementation, the running time depends on the number of 
arithmetic operations involved. For example, the linear system Ax = 6, with A 
being n x n, can be solved by the algorithm of Gaussian elimination, using 0(n 3 ) 
operations of addition, subtraction, multiplication and division. We define 

• f(n) = 0(g(n)) if there exists a c such that f(n) < cg(n) for all n. 

• f(n) = Q(g(n)) if there exists a c such that f(n) > cg(n) for all n. 

• f( n ) = 0(g(n)) if f(n) is both 0(g(n)) and ft(g(n)). 

Of course multiplication is more difficult than addition and so in computing 
running time we might count operations according more elementary computer 
instructions. In what follows we make use of Turing's famous proof that the 
class of things that can be computed is the class things that can be computed 
by a Deterministic Turing Machine (DTM). A DTM is essentially a finite-state 
machine that can read and write to an external storage medium. 

When a DTM is given an input x it runs for some number of steps and then 
outputs an asnwer, f(x). This number of steps is its running time. There are many 
Turing machines. Let Tm(ti) be the worst-case running time of some Turning 
machine, say M, over inputs x of size \x\ = n. We say that a function f(x) is 
computable in polynomial time if there exists some Turing machine that can 
compute f(x) within \x\ k steps (for some fixed k). The definition is robust, since 
different Turing machines can simulate one another and more efficient models 
of computation, by at most squaring or cubing the the computation time. In 
contrast, if Tm(ti) = Q(2 cn ) for all M, then f(x) is said to be computable in 
exponential time. 
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5.2 The Travelling Salesman Problem 

Given a finite set of points S in the plane, the TSP asks for the shortest tour of 
S. More formally, given S = {s\, s 2 , • • • , s n } a shortest tour that visits all points 
of S is specified by a permutation o that minimizes the sum of distances 

d(s a (i), S<t(2)) + rf(S(r(2))S(7(3)) + 1 - ^( s <r(n-l) ? »<r(n)) + ^( s a(n) 5 S<r(l) ) 

where c£(si, Sj) is the distance between points z and j. 

In general, it is difficult to prove that a problem does not have a polynomial 
time algorithm. No polynomial time algorithm is known for TSP. But there is 
also no proof that a polynomial time algorithm for TSP does not exist. We see in 
Lecture 6 that the simplex algorithm for LP is an exponential time algorithm. It 
was not until the 1970s that a polynomial time algorithm was discovered for LP. 

5.3 Decision Problems 

A decision problem (or recognition problem) is one that takes the form of a 
question with a yes /no answer. For example, decision-TSP is 

Given the points S, and L is there a tour of length < LI (5-1) 

This differs from optimization-TSP: find the shortest tour, or the evaluation-TSP: 
find the length of the shortest tour. Of course the three types of problem are closely 
related. We focus on decision problems. This avoids problems in which the size 
of the input or output causes non-polynomial time behaviour. 

5.4 V and AfV problems 

A decision problem in is V if its answer is computable in polynomial time. I.e., 
there exists a deterministic Turing machine which, given any instance (with input 
data x), can compute the answer within a number of steps bounded by \x\ k (for 
some fixed k). 

A decision problem belongs to MV if there exists a checking function r(x,y) 
such that the answer to the decision problem is yes iff there exists a y (called a 
certificate) such that r(x 7 y) = 1 and r(x 7 y) is computable in polynomial time. 
For example, if the answer to (5.1) is yes then y could be the order in which the 
points should be visited. It takes only time 0(n) to add up the length of this tour 
and check it is less than L (this being the computation r(x,y)). 

L MV stands for nondeterministic polynomial. An equivalent definition of 
MV is that it is the class of decision problems whose answers can be computed 
in polynomial time on a 'nondeterministic Turing machine' (NDTM). A NDTM 
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consists of many DTMs working in parallel, any one of which can answer yes in 
polynomial time without consulting the others. Essentially, these computers are 
carrying out parallel calculations of r(x, y) for all possible y. Either one of them 
produces the answer yes within time n fc , or the answer is no. A NDTM for (5.1) 
could consist of (n — 1)! DTMs, each of them checking one possible tour to see if 
its length is less than L. Clearly, V C J\fV. It is believed that V C AfV: that is, 
there are problems in AfV which are not in V. However, this is a major unsolved 
problem. 




5.5 Polynomial reduction 

When is problem III no harder than another problem II2? We say that III re- 
duces to II2 if we can construct an algorithm for IIi as follows. 

1. Make a (polynomial time) transformation of the instance of IIi into an instance 

of n 2 . 

2. Apply some algorithm to solve this instance of n 2 . 

3. Make a (polynomial time) transformation of this solution of II2 back into a 
solution of IIi. 

The idea is that we can use an algorithm that solves II 2 to solve IIi, with additional 
work in steps 1 and 3 that requires at most polynomial time. Thus TEi is really 
no harder than II2. 

5.6 AfV- completeness 

Now we can talk about the hardest problems in AfV. A problem II is said to 
be A/T^-hard if every problem in AfV can be reduced to it. It is said to be AfV- 
complete if moreover II G AfV. Thus all AfV- complete problems can be reduced 
to one another and are as difficult as all problems in AfV. 

There are many ./VP-complete problems. LP in which all variable are restricted 
to be or 1 is ./VP-complete. TSP is ./VP-complete. So all problems in AfV are no 



Examples of 7VP- complete problems 
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harder than either of these problems. If you can find a polynomial time algorithm 
for TSP then you have found a polynomial time algorithm for all problems in 
AfV and it would be true that V = MV . As we said, since no one has ever found 
a polynomial time algorithm for any ./VP-complete problem, it is believed that 
V 7^ NV. To show that a new problem, II, is ./VP-complete we must(i) show that 
II 6 AfV, and (ii) show that a known A/'P-complete problem reduces to II. 

5.7 Examples of TVP-complete problems 

Satisfiability (Cook (1971) Given a propositional formulae with AND's, NOT's, 
OR's and Boolean (T or F) variables Xl, X2, • • • , X r , for example, 

(X 1 OR NOT X 2 ) AND (X 3 AND Xj) 

is there an assignment of truth values to the variables that makes the formulae 
true? (e.g. X\ = X 2 = X3 = X4 = T in the above example.) 

Hamiltonian circuit Given a graph G. Is there a set of edges forming a tour 
of all the vertices? To see that an instance of this is no harder than as TSP, think 
of a TSP instance with d(si, Sj) — 1 if there is an edge from i to j and d(si, Sj) = 2 
if there is not. Ask, 'is there a tour of length < n?' 

Subgraph isomorphism Given two graphs G 1 G' . Does G contain a subgraph 
isomorphic to G'l Interestingly, Graph ismorphism (i.e., 'Are graphs G and G' 
isomorphic?') is known to be in J\fV, but it is suspected to be neither in V or 
./VP-complete. 

Clique decision problem Given a graph G and number k. Does G contain a 
clique of size fc? (i.e., k vertices all pairs of which are connected together). E.g., 
below left: k — 3, yes; k — 4, no. 




Vertex cover decision problem Given a graph G and number k. Is there a 
set of k vertices such that every edge of G starts or finishes at one of them? Such 
a set of vertices is called a vertex cover. E.g., above right: k = 2, no; k = 3, yes. 



6 Computational Complexity of LP 

6.1 Running time of the simplex algorithm 
Worst-case running time 

The simplex algorithm moves from one basic feasible solution to an adjacent one, 
each time improving the value of the objective function. However, it can take an 
exponentially large number of steps to terminate. 

Suppose the feasible region is the cube in R d defined by the constraints 

< Xi < 1 , i = 1, . . . , d 

and we seek to maximize xa- There are 2 d vertices. The paths shown below visit 
all vertices before terminating at (0, 0, . . . , 1). 




Given < e < 1/2, consider the perturbed unit cube given by the constraints 

e < x\ < 1 , exi-i < Xi < 1 — exi-i i — 2, . . . , d . 

It can be verified that the cost function increases strictly with each move along 
the path. For example, for d = 2 we have 
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Note that X2 increases along the route ABCD. So if our pivoting rule is always to 
move to the adjacent bfs for which the entering variable has the least index (so- 
called Bland's rule), then the simplex algorithm will require 2 d — 1 pivoting steps 
before terminating. With this pivoting rule the simplex algorithm has exponential 
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worst-case time complexity. Observe that the initial and final vertices are adjacent 
and a different pivoting rule could reach the optimum in only one step. However, 
for all common pivoting rule that have been studied there is some instance for 
which the number of steps until termination is exponential. It is unknown whether 
there is a pivoting rule that might make the simplex algorithm more efficient. 
This is related to the Hirsch conjecture: that for a polytope in W 1 defined by 
m inequality constraints the number of pivots required to move from one bfs to 
another bfs is never more than m — n. 

Average-case running time 

Not all algorithms with exponential time complexity are bad in practice. Often, 
they can have very good performance. The simplex algorithm appears to perform 
well on average. The difficulty is defining what is meant by 'on average'. 

6.2 The complexity of LP 

We shall investigate alternative algorithms for solving LP. In particular, we seek 
a polynomial time algorithm. If we can show that LP G P then this is likely to 
tell us something about good algorithms in practice. There are two classes of 
methods for solving LP. 




Boundary value methods Interior point methods 



The size of an LP instance 

Any non-negative integer, r, (r < U) can be written in binary form 

r = a fc 2 fc + a fc _i2 fe - 1 + --- + ai2 1 + a 2° < 2 lo ^ u 

where ao, a>i, • • • ,ak are or 1. The number k is at most [log 2 U\. Thus, using 
an extra bit for the sign, we can represent any integer r where |r| < U by at most 
(Llog 2 C7J+2) bits. 

An instance of an LP problem is given by a m x n matrix A, a m- vector b and 
a n-vector c. So, assuming that the largest magnitude of any of the components 
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is U, an LP instance has a size in bits of 

(mn + m + n) ( |_log 2 U\ + 2) . 

6.3 Feasibility problem 

Consider the primal/dual pair: 

P : minimize {c T x : > b] 

D : maximize{6 T A : ^4 T A = c, A > 0} . 

By the strong duality of linear programming, each problem has an optimal 
solution if and only there is a feasible solution to 

b T \ = c T x Ax>b, A T X = c 7 A>0. 

Thus we can solve LP if we can solve a feasibility problem like this. We shall 
therefore focus on feasibility and the decision problem 

Is the polyhedron P = {x E R n : Ax > b} non-empty? 

The algorithm that we shall use is known as the ellipsoid method. 

6.4 Preliminaries for ellipsoid method 

Definitions 6.1 

1. Let D be a n x n positive definite symmetric matrix. The set 

E = E(z, D) = {xeR n : (x- z^D'^x - z) < 1} 
is called an ellipsoid with centre at z G R n . 

2. Let D be a nx n non-singular matrix and t G IR n . The mapping S : R n i— ► R n 
defined by S(x) = Dx + 1 is called an affine transformation. 

3. The volume of a set L C W 1 , denoted by Vol(L), is defined by 

Vol(L) = f dx. 

We shall use the result that if S is given by the affine transformation S(x) = Dx+t 
then 

Vol(5(L)) = |det(D)|Vol(L). 



Intuitive description of the ellipsoid method 
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6.5 Intuitive description of the ellipsoid method 

We generate a sequence of ellipsoids {E t }. E t has centre xt, such that 

• If Xt £ P 1 then P is non-empty and the method stops. 

• If xt ^ P, then there is a violated constraint with AiXt < bi, where Ai is some 
row of A and bi is the matching element of b. 

Thus, P is contained in the half-space {x £ M. n : AiX > AiXt}. Call the intersec- 
tion of this half-space with E t a half-ellipsoid. 

We construct the ellipsoid E t +i in such a way that it covers this half-ellipsoid 
and has volume only a fraction of that of E t . 

We repeat this procedure until either we find a point of P or we conclude that 
the volume of P is very small and therefore can be taken as empty. 




The key result is as follows. 

Theorem 6.1 Let E = E(z,D) be an ellipsoid in M. n , and a be a non-zero n- 
vector. Consider the half-space H = {x £ M n : a T x > a T z] and let 

1 Da 

z = z ~\~ 

n+ 1 Va T Da ' 

— n 2 / 2 Daa T D\ 

D = — D = . 

n 2 - 1 V n+1 a T Da J 

Then the matrix D is symmetric and positive definite and thus E' = E(~z, D) is 
an ellipsoid. Moreover, 

(a) EHH C E', 

(b) Vol(E') < e -V(2(n+i)) y i( E ) . 



7 The Ellipsoid Method 



7.1 Ellipsoid algorithm 

Khachiyan's ellipsoid method (1979): 
Input 

(a) A matrix A and vector b denning the polyhedron P = {x G M. n : — 
bi,i = 1, . . . ,ra}. 

(b) A number t> , such that either P is empty or Vol(P) > v. 

(c) An ellipsoid (in fact, a ball) Eq = E(xo,r 2 I) with volume at most V, such 
that P CE . 

Output A feasible point x* G P if P is non-empty, or a statement that P is 
empty. 

We will show subsequently that v — n - n (nU)- n ( n+1 ) > Vo\(E t *) and 
V = (2n) n (nU) n2 > Vol(E ) > Vol(^i) > • • • > Vo\(E t *) . 




Initialize step Let 

t* = r2(n + 1) bg(V/«)l , E = E{x ,r 2 I), D = r 2 I, t = 0. 
Main iteration 

(a) If t — t* stop; P is empty. 

(b) If Xt £ P stop; P is non-empty. 
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(c) If Xt ^ P find a violated constraint, i, such that < &v 

(d) Let ff t = {ief : AT X > ATzt}. 

Construct an ellipsoid -E^+i = -E^t+i, D t+ i) containing E t fl -fft with 

1 A^i 



z t+ i = x t + 



77, 



77- 



t+i 



n 2 — 1 



+ 1 VAJ ' 

' 2 DtAjAjDt 

n + 1 AjD t Ai 



D, 



(e) i := i + 1, return to (a). 

7.2 Proof that E H H C E' 



First, consider the case 2 = 0, D = I and a = e± = (1, 0, . . . , 0) . So, £"i 
R n : £ T £ < 1} and H = {x G R n : a?i > 0}. 



Hence, 



n + 1 ' n 2 — 1 V n + 1 



Re- writing this, we have 



E' Q = I x G 



n + 1 



77 



2 „2 



n+1 



+ 



n z - 1 



n 



E 



^ — 1 

i=2 



n 2 -l " 2 , 2(n + l) 2 



77/ 



77, 



2 X l 



+ 



'n+1 



^ + _L_|<1 



77, 



n + 1 (n + 1) : 
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So that 

f ™„ n 2 -l^A 2 1 2(71 + 1), N 1 

Now suppose x £ Eo D Ho- Then < x\ < 1, and so — 1) < 0. Also, 

SiLi < 1. Hence 

n 2 -lA 2 1 2(n + l) , _ n 2 -l 1 

5 — > x,- H — ^ H ^ aJi(a;i - 1) < ^ 1 — =■ = 1 , 

n A — ' n A n z n z n z 

i=i 

which verifies that x G E' Q , proving that Eq H -f/o C -Eg • 

Now, consider the general case and construct an affine transformation T(-) 
such that T(E) = Eq, T(H) = H and T(E') = E' . The result then follows since 
affine transformations preserve set inclusion, i.e. if A C B C M n and T(-) is an 
affine transformation, then T(A) C T(B). 

Given E = I}), introduce the affine transformation 

T(x) = RD~ 1/2 (x-z) 

where R is a rotation matrix which rotates the unit ball so that D x l 2 a is aligned 
with the unit vector e\ — (1, 0, . . . , 0) T i.e. 

R T R = I and RD Y l 2 a = ||Z) 1//2 ||ei 

So now, T(E') = £"o since 

x e E ^> (x - z) T D~ 1 {x - z) < 1 

(x - z)D- 1/2 R T RD- 1/2 (x -z)<\ 
^=> RD~ 1/2 (x - z) G Eq 
T(x) G E . 

Similarly, T(H) = Ho since 

x G H a T (x - z) > 

a T D 1/2 R T RD- 1/2 (x - z) > 

e^TfV) > 
T(ir) G H 

Similarly, one can show T(E') = E' . Above, we proved that EqC\Ho C E' q , which 
is equivalent to T(E) n T(H) C T(E'), which implies E D H C E' . ■ 



Proof that Vol(£") < e~V M n+1 »Vo\{E) 
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7.3 Proof that Vol(E') < e~ l / ^ n+l ^Vo\{E) 

We have that 

Vol(E') Vol(r(E")) Vol(^) 
Vo\(E) ~ Vo\(T(E)) ~ Vo\(E ) ' 

Now, 

\n+ 1 ' n 2 - 1 V n + 1 1 7/ 
So, introduce the afhne transformation 



n + 1 — 1 \ n + 1 yy 

One can easily check that E'g = F(E ). So 



1/2 

.1' 



Hence, 



Vol(^) = Jdet fj^ (l--± jei eA) Vo\(E ) 



Vol«W n 2 \ n/2 ( 2 \ 1/2 

Voi(^o) V™ 2 - 1 / V n + iy 

n / n \ 



n+1 V ^ 2 — 1 



1 \ / i x^- 1 )/ 2 



= 1 — 1 + 



n + 1 J \ n 2 — 1 

< e -l/(n+l) ^/(n 2 -^ 
_ e -l/(2(n+l)) 

using (twice) the inequality 1 + a < e a (a ^ 0). Therefore, 

V0l (g) e -l/(2(„ + l)) 

Vol(E) 



8 Complexity of the Ellipsoid Algorithm 



8.1 The bound V 

Lemma 8.1 Let A be a rax n integer matrix and let b be a vector in W m . Let U 
be the largest absolute value of the entries of A and b. Then, 

(a) Every extreme point of the polyhedron P = {x 6 M. n : Ax > b} satisfies 

-{nU) n < Xj < (nU) n , j = 1,. 



n 



(b) Every extreme point of the (standardized) polyhedron P = {x G 
satisfies 

-{mU) m < Xj < {mU) m , j = 1, . . . , n . 



Ax = b] 



Proof. Consider first (a). We are assuming here that m > n. Let x be an 
extreme point of P. Choose n linearly independent active constraints and write 
Ax = b where A is n x n invertible submatrix of A and b is the matching n- 
dimensional subvector of b. So, we have x = A _1 b. 
By Cramer's rule, we can write the solution 



Xj 



det(^) 



det(A) 

where A 7 is the same as A except that the jih column is replaced by b. Now 



det(A J ) 



£(-i) H n 



a 



i,a(i) 



i=l 



< n\U n < (nU) n , j = l,...,n, 

. ,n, with \a\ giving the number of 



where a is one of the n\ permutations of 1, . 
inversions (i.e., i < j and a(i) > o~(j)). 

Finally, since A is invertible, det(A) ^ and all entries in A are integer so 
|det(A)| > 1. Therefore, the extreme point x satisfies 

\xj\ < (nU) n , for all j. 

Exactly, the same argument may be used for (b) except that we use a basis matrix 
Ab ■ Ab is m x m and we can replace n by ra throughout. ■ 

By part (a) of the previous lemma all the extreme points of P = {x G M n : 
Ax > b} are contained in the bounded polyhedron Pb defined by 



P B = {xeP : \xj\ < (nU) n , j = l,...,n} 
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P is nonempty if and only if it contains an extreme point. Hence, we can test for 
the emptiness of Pg instead of P. But Pb is contained in the ball -E^O, n(nU) I) 
whose volume is at most 

V = (2n(nU) n ) n = (2n) n (nU) n2 . 

8.2 The bound v 

We say a polyhedron P is full-dimensional if it has positive volume. For ex- 
ample, P = {(^1,^2) : x\ + X2 = l,xi,X2 > 0} has Vol(P) = and so is not 
full-dimensional. 

Lemma 8.2 Let P = {x E M n : Ax > b} and assume that A and b have integer 
entries which are bounded in absolute value by U . Let 

e = ^-^yy [(n + l)U]~ {n+1) , P e = {x E R n : Ax > b - ee] 

where e T = (1, 1, . . . , 1). Then 

(a) If P is empty, then P e is empty. 

(b) If P is non-empty, then P e is full- dimensional. 

Proof of (a) Suppose P is empty and consider the infeasible linear program 
minimize {0 T x : Ax > b] and its dual maximize {X T b : A T ^4 = T , A > 0}. 
Since the primal is infeasible the dual problem has value +oc. Therefore, there 
exists a A > with 

A T ^ = T A T 6=1. 

So, using the previous lemma, we can find a bfs A to the constraints A T ^4 = T , 
A T 6 = 1, A > such that 

A 4 < {{n + l)U) n+1 , foralH. 
Since A is a bfs, at most n + 1 of its components are non-zero so that 

m 

]TA, < (n + 1) ((n + l)U) n+1 . 

i=l 

Therefore, 

A T (6-ee) = l- e^Ai>->0. 

i=l 

Hence, when we replace b by b — ee the value of the dual remains +00 and the 
primal problem is again infeasible and P e is also empty. 
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Proof of (b) Let x be an element of P so that Ax > b. Let y be a vector such 
that 

e e . 

£j 77 < 2/j < »j + "T7 , for all j . 

nU nil 

Then the ith row of Ay satisfies 



a ij I 



n n n 

j=l j=l j=l 

>h- 4==nU 
nil 

= bi- e. 

Therefore, any such vector y belongs to P e and the set of all such vectors y (a 
cube) has positive volume (of (2e/nU) n ) and so is full-dimensional. ■ 

The following lemma can also be proved. 

Lemma 8.3 Let P = {xG R n : Ax > b} be a full- dimensional bounded polyhe- 
dron, where the entries of A and b are integer and have absolute value bounded 
by U. Then, 

Vol(P) >n" n (nt/)- n2(n+1) . 



8.3 Running time of the ellipsoid method 

We have the values 

V = (2n) n (nU) n2 and v = n- n {nU)- n ^ n+l) 

and know that the ellipsoid method takes at most t* = [2(n + 1) log(V/i;)] steps. 
This gives t* = 0(n 4 log(nC/)). 

In practice, we apply the ellipsoid algorithm to P e . We know that P is 
nonempty if and only if P e is nonempty. Let P e = {x : (l/e)A > (l/e)b— e}. Recall 
U is an integer, so 1/e is also an integer and so this writes the constraints of P e so 
that all coefficients are integers that are bounded in magnitude by U e — U/e. We 

2 

know that P e is contained in a ball of volume at most V e = (2n) n (nU e ) n . Also, 
when P is nonempty P e has volume of at least v e = (2e/nU) n (by the proof in 
part (b) of Lemma 8.2). Thus the ellipsoid algorithm applied to P € will terminate 
in a number of steps at most \2(n + l)\og(Ve/v e )~\ = 0(n 4 log (nU)). 

There are a few wrinkles to consider: on a computer we cannot actually cal- 
culate the square root \J Aj D t Ai that is needed to find x t +i from x t . There is 
also a worry that in multiplying numbers together we might be forced to use ones 
as large as 2 U . However, it can be shown that if we carry out the algorithm to 
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a certain accuracy it will still correctly decide whether P is nonempty. At each 
step the calculation of Aj D t Ai requires (n 2 ) operations, which makes the running 
time of the algorithm 0(n 6 log(nC/)). 

8.4 Sliding objective ellipsoid method 

Suppose we wish to solve the problem minimize {c T x : Ax >b,x> 0}. First, use 
the ellipsoid method to find a feasible solution xq £ P, where P = {x £ M n : 
Ax > b}. Now we apply the ellipsoid method again (note the strict inequality) to 
the new polyhedron given by 

PH{x eR n : c T x < c T x }. 

If this is empty then xq is optimal. Otherwise, we have a new solution x\ G P, say, 
with strictly smaller objective function than c T xq. Now we reapply the ellipsoid 
method to the new polyhedron. 




At each iteration we add a new constraint in the direction of the vector c. All the 
constraints c T x < c T Xt are parallel to one another. One can show that by this 
procedure we reach the optimum in polynomial running time. 



9 The Network Simplex Algorithm 

9.1 Graph terminology 

The next four lectures are about network flow problems. They include trans- 
portation, assignment, maximum flow and shortest path problems. 

A graph G = [N 1 A) consists of a set of nodes, N, and a set of arcs, A. In an 
undirected graph the arcs are unordered pairs of nodes G A, i,j G N. In a 
directed graph (also called a network) the arcs are ordered pairs of nodes 
A walk is an ordered list of nodes ii, 12, ■ ■ ■ , it such that, in an undirected graph, 
{ik, ik+i} G A, or, in a directed graph, that either (ik,ik+i) G A or (ik+i,ik) G A, 
for k — 1, . . . , t — 1. A walk is a path if ii, i2, . . . , ik are distinct, and a cycle if 
ii, Z2, . . • , ifc-i are distinct and %\ = %k- A graph is connected if there is a path 
connecting every pair of nodes. 




a directed graph a spanning tree (dotted) 



A network is acyclic if it contains no cycles. A network is a tree if it is 
connected and acyclic. A network (N f , A') is a subnetwork of (TV, A) if N' C N 
and A' C A. A subnetwork (AT', A') is a spanning tree if it is a tree and N f = N. 



9.2 The minimum cost flow problem 

Let fij denote the amount of flow of some material on arc G A. Let bi, 

i E N, denote the amount of flow that enters the network at node i G N. If bi > 
we say the node is a source (supplying b{ units of flow). If bi < we say that 
the node is a sink (with a demand of \bi\ units of flow). 

Suppose there is a cost of Cij per unit flow on arc G A. The minimum 
cost flow problem is 

minimize fij 

(i,j)eA 
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subject to 

bi + ^2 hi = ^2 , fo r all « G N 

j:(j,i)EA j:(i,j)EA 

mj < fij < Mij , for all G A . 

These say that flows must be feasible and conserve flow at each node. For feasible 
flows to exist we must also have ^ i£N bi = 0. An important special case is that 
of uncapacitated flows, rriij = and = oc. 

Note that the minimum cost flow problem is a special form of linear pro- 
gram. Its simple structure allows for special algorithms. Constraints are of the 
form Ax = 6, where 

( +1 node i is start of kth arc ; 
(A)ik = < —1 node i is end of kth arc; 
otherwise . 



9.3 Spanning tree solutions 

Assume that the network is connected. A spanning tree solution, fy, is one 

that can be constructed as follows 

1. Pick a set T C A of n — 1 arcs forming a spanning tree and partition the 
remaining arcs A \ T into the two sets L and U. 

2. Set fij = rriij for each arc G L and fij = for each arc G U. 

3. Use the flow conservation constraints to determine the flows fij for arcs (i, j) G 
T. We begin by determining the flows on arcs incident to leaves of the tree T. 
Subsequently we determine the flows on other arcs of T. 

A spanning tree solution with rriij ^ fij < is a feasible spanning tree 
solution. 




Theorem 9.1 A flow vector is a spanning tree solution if and only if it is a 
basic solution. 
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9.4 Optimality conditions 

Consider the Lagrangian of the minimum cost flow problem 

(i,j)EA *GiV \j:(i,j)eA j:(j,i)EA 

Minimizing A) over m^- < fij < Mij gives dual feasibility and comple- 
mentary slackness conditions: 

Cij Cij Aj ~\- Xj ^> '* fij TH'ij 

Cij = cij - A 4 + Xj < fij = 

c^ = c^ - Xi + Xj = -<= my < /y- < Mij 

Observe that if T is a spanning tree then we can solve the following equations 
in a unique way, where n = \N\. 

A n = , Ai - Aj = c^ , for all G T 

9.5 Pivoting to change the basis 

We compute the reduced costs Qj = — (Ai — Aj) for each arc G" T. 

Recall Cij = for all arcs 6 T by construction. 

If c^ > for all G -L and < for all G £7 then the current 

basic feasible solution is optimal. Otherwise, choose an arc where there is a 
violation. This arc together with the tree T forms a cycle. Add (or subtract) as 
much flow as possible around this cycle so as to increase (or decrease) fij. Note 
that ^2 k gCk£ = Ylki Ck £ = Cij-, where the sums are taken around the arcs of the 
cycle. Thus if Cij is negative we can decrease the total cost by increasing the flow 
fij . Similarly, if is positive we can decrease cost by decreasing the fij . 

Example Consider the minimum cost flow problem below. On each arc we give 
the values of (cy, my, M^). There is b\ = 6, 62 = —4, and 63 = —2. The spanning 
tree consists of 2 arcs (shown undashed). In the left hand figure, we set Ai = 
and find A2 = —3 (so c\2 = 3 = Ai — A2). Similarly, A3 = —5. On the arc (1,3) 
the value of C13 — Ai + A3 = 1 — (0) + (—5) = —4. Since this is < we can decrease 
cost by increasing /13. Inserting the arc (1,3) into the tree produces the cycle 
(1, 3, 2, 1). We increase the flow /13 as much as possible shifting flow around this 
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cycle (i.e., by 1). This produces the flows shown in the right diagram. The tree is 
now arcs (1,3), (1,2). We recalculate: Ai = 0, Ai = —3 and A2 = — 1. The value 
of C23 — A2 + A3 = 2 — (—3) + (—1) = 4. Since this is > we want flow on (2, 3) 
be minimal, which it is. So we now have the optimal solution. 




9.6 Finding the initial feasible tree solution 

1. Every network flow problem can be reduced to one with exactly one source 
node and one sink node (by adding in two nodes). 

2. Every network flow problem can be reduced to one without sources or sinks 
(by connecting the above two nodes with an edge). The constraints are just 
Af = 0. Any / satisfying this is called a circulation and such flow problems 
are called circulation problems. 

3. In the case that rriij = 0, for all the zero flow is a feasible tree solution. 
If rriij 7^ for some arc we can replace the flows by — rriij and adjust 
the supplies bi accordingly. 

9.7 Integrality of optimal solutions 

Suppose the input data (rriij, Mij and bi) are all integers. Then the above algo- 
rithm leads to optimal integer solutions. There are no multiplications or divisions. 

Theorem 9.2 (Integrality theorem) For every network flow problem with in- 
teger data, every basic feasible solution and, in particular, every basic optimal 
solution assigns integer flow to every arc. 

This theorem is important for the many practical problems in which an integer 
solution is required for a meaningful interpretation (for example, the assignment 
problems). Later, we investigate linear programming problems subject to the 
additional constraint that the solution be in integers. Such problems are usually 
much harder to solve than the problem without the integer constraint. However, 
for network flow problems we get integer solutions for free. 



10 Transportation and Assignment 
Problems 



10.1 Transportation problem 

In the transportation problem there are m suppliers of a good and n customers. 
Suppose supplier i produces Si units of the good, customer j demands dj units of 
the good, and there is a balance between demand and supply so that 

m n 
i=l j=l 

Suppose the cost of transporting a unit of good from supplier i to consumer j 
is Cij. The problem is to match suppliers with consumers to minimize the total 
transportation cost. We can easily formulate the transportation problem as a 
minimum cost flow problem as follows 



minimize 



EE 

i=l j=l 



Cij fij 



subject to 



d 



j ' 



n 



fa>o, 



3 = 1 

for all 



1, 



, m 



0,M 



oo 



This is a special case of the minimum cost flow problem with rriij 
and the graph structure of a bipartite graph. That is, the nodes divide into 
disjoint sets S (suppliers) and C (customers) and and A C S x C (the only arcs 
are those which connect suppliers to consumers). 



Suppliers 


Customers 


si (T)^^ 






^^(T) di 










S3 





Lemma 10.1 Every minimum cost flow problem is equivalent to a transportation 
problem. 
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Proof. Consider the minimum cost flow problem with rriij = 0, Mij < oo, and 
input data G = (TV, A), Mij, Cij and hi. For every arc G ^4 construct a 

source node with supply M^ . For every node i E N construct a sink node with 
demand ^ fc . ^ k)eA ~~ ^i- Now connect every source node to each of the 
sink nodes i and j with infinite upper bounds on capacities. Let Cijj = and 

c ij,j = 



demand 




There is a 1-1 correspondence between feasible flows in the two problems and 
these flows have the same costs. To see this put a flow of on the arc from i, j 
to j, and a flow of — on the arc from i,j to i. The total amount flowing 
into node i is then — fij) + Ylj fjii which must equal ^ ■ Mjj — hi. Thus 

we have the flow conservation constraints of the minimum cost flow problem. ■ 

For this reason new algorithms are often first tested on transportation prob- 
lems. The case in which there is an arc from every supplier to every consumer is 
known as the Hitchcock transportation problem. 



10.2 Tableau form 

It is convenient to present the input data and spanning tree solutions (i.e., the 
bfs's) for the transportation problem in tableau form. (This is a different form of 
tableau to that of the simplex tableau). We express the input data in a tableau 
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The Xi are computed using the fact that we require Xi — Xj = Cij wherever > 0. 
At the first interation we increase (by as much as possible) the flow in an empty 
cell where Xi — Xj > C{j (i.e., Cjj > 0). We do this by adding and subtracting 9 
around some cycle of cells in which > 0. 
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The final tableau above contains the optimal solution because we have Xi—Xj = 
everywhere that > and — Xj < everywhere else. 

10.3 Assignment problem 

Given a set P of m people and a set T of m tasks and a cost, c^, the assignment 
problem is one of choosing variables to 

m m 

minimize ^ ^ ajfij , 
i=i j=l 



J 1 if person i is assigned to task j 

1 otherwise. 

1 , for all i — 1 , . . . , m 

1 , for all j = 1 , . . . , m . 



subject to 

fij = 

m 

£/« = 

m 



i=l 



These constraints say that each person is assigned to exactly one task and that 
every task is covered. Except for the integer constraints, the assignment problem 
is a special case of the Hitchcock transportation problem 
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10.4 Integer constraints 

The problem in which the integer constraints are replaced with < < 1 is 
known as the LP-relaxation of the assignment problem. If we use the spanning 
tree method then our solution will take values or 1 and hence be optimal for 
both the LP-relaxation and the assignment problem. 

Had we used a non-simplex type method to solve the underlying linear program 
(e.g., some interior point projective algorithm) then an integer- valued optimal 
solution may not be guaranteed. It is a feature of the method and not the problem. 
Many LP-relaxations of problems have multiple non- integer solutions. 



10.5 Maximum flow problem 



Suppose we have a network with a single source node, 1 and a single sink node n 
and upper bounds M^- on all the arcs. Also, assume for convenience that rriij = 0. 
The maximum flow problem is then to send as much flow from 1 to n. We write 
this as 

maximize 5 



subject to 



E 



f, 



'J 



< fij < Cij , 




5 


-5 



i = 1 
i 7^ 1, n 
i = n 



We can formulate the maximum flow problem as a minimum cost flow problem 
by adding an additional arc (n, 1) to the network with m n \ = and M n \ = oo 
and then assign cost c n \ = — 1 to the arc (n, 1) and zero cost to all the original 
arcs. 

Since, the only arc with non-zero cost has negative cost it follows that the 
optimal solution to this minimum cost flow problem will circulate as much flow 
as possible across the network, constrained only by the original arc capacities — 
i.e., it also solves the maximum flow problem. 









3 






Cnl = "I 





11 Maximum Flow and Shortest Path 
Problems 

11.1 Max- flow min-cut theorem 

We return to the max-flow problem of Section 10.5. For S C N define the capacity 
of the cut [S, N\S] as 

C(S,N\S)= dj- 

i£S,j£S 

Theorem 11.1 (Max-flow min-cut theorem) 

Max-flow , S = min cut capacity = min C(S,N \ S) 

S:l£S, n^S 

There are two parts to the proof. First 

value of any flow < capacity of any cut 

Define 

f(X,Y)= h 

i£X,j£Y:(i,j)£A 

and suppose that 1 G S, n ^ S. Then 

■ v( v /,, v /,, ) 

^G5 \j:(i,j)EA j:(j,i)eA ] 

= f(S,N)-f(N,S) 

= f(S, S) + f(S, N\S)-f(N\S,S)- f(S, S) 
= f(S,N\S)-f(N\S,S) 
<f(S,N\S) 
< C(S, N\S). 

We now complete the proof using the Ford-Fulkerson algorithm. Suppose 
that fij is optimal and recursively define S C N &s follows 

1. 1 G S 

2. If i G S and fij < Cij then j G S 

3. If i G S and fji > then j G S. 
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So, S is the set of nodes to which you can increase flow. Either n G S in which 
case we can increase flow along a path from 1 to n, or n S so that [S, N\S] is 
a cut with 1 G S and n S. But for i £ S, j S, fij = Cij, fji = and 

6 = f(S,N\S)-f(N\S,S) = C(S, N\S). 

We can take zero flow fij = as the initial flow. If all capacities and initial 
flows are integer then every step increases the flow by at least one unit. Thus the 
algorithm will converge in a finite number of steps. 

Dual formulation 

We can recast the max-flow problem as a minimum cost flow problem: 

minimize —f n i 
subject to fij — fji = , for all « G N 

j:(i,j)eA j:(j,i)EA 

< fij < dj , for all G A , f nl > . 

Consider the Lagrangian in the usual way with dual variables, A^, i G N. For 
optimality on arc (n, 1) we have (c n i = —1) 

Cnl = C n i — A n + Ai = , 

so that Ai = 1 + A n . On all the other arcs the costs are zero so that the reduced 
costs are just Qj = Xj — Xi and at an optimal solution 

Xj - Xi > =^ fij = 
Xj — Xi < =>• fij = Cij . 

So Xi = 1 for i e S and Xj = 0, j G N \ S. 

11.2 Project management 

A project that is described by a set of jobs that must be completed in a certain 
order. Job i has a duration r^. How can we determine the least time in which the 
project can be completed? 

Consider a graph in which there is an arc (i, j) whenever job i must be com- 
pleted before job j. Introduce two additional jobs, s and s', each of zero duration, 
to indicate the start and finish of the project, and introduce arcs (s, i) and (i, s') 
for every job i. Suppose we start job i at time ti. We wish to 

minimize (t s > — t s ) , subject to tj — ti > Ti , for all (i, j) G A . 
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The dual of this problem is 



n ^ 



maximize %J%3 



subject to 

^ fji ~ ^2 A? = ~ b i > for a11 i > and fij ^ ' for a11 (hJ) e A ' 

j:(j,i)EA j:(i,j)eA 

where b s = 1, b s > = —1 and 6^ = for i ^ s,s f . This is a minimum cost flow 
problem with each arc cost being —7$. The path of arcs for which = 1 defines 
the critical path. 



11.3 The shortest path problem 

Shortest path problems have applications in transportation and communications, 
and are often subproblems embedded in more complex problems. Although they 
are special forms of minimum cost flow problems they can be solved more effi- 
ciently by specialized algorithms. Given a network (TV, A) we think of each arc 
having a length Cij > and consider paths in which arcs are traversed in the 
forward direction only. The length of a path is the sum of the lengths of the as- 
sociated arcs. A shortest path between a given pair of nodes is the path between 
them of minimum length. It is convenient to consider the problem of finding the 
shortest paths from all nodes to a given destination node. 

Take some node, say n = \N\, as a root node. Put a demand of n — 1 at 
this node (that is, b n — — (n — 1)) and a supply of one unit at every other node 
(bi — • • • — b n -i = 1), so that total supply and demand are equal. Let the cost 

of arc be given by its length and solve this minimum cost flow problem 

by the network simplex algorithm. Then the shortest path from any node i to n 
is given by following the arcs of the spanning tree from i to n. 
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Let Vi be the shortest distance from node i to the root node n. These quantities 
are known as labels. Algorithms which systematically determine their values in 
some order are called label-setting algorithms. Algorithms which find their 
values through a sequence of iterations are called label-correcting algorithms. 

11.4 Bellman's equations 

Consider the minimum cost flow problem formulation of the shortest path prob- 
lem. Suppose that the A^s are the optimal dual variables associated with the 
optimal spanning tree solution. Recall that on each arc of the tree, where > 0, 
we must have 

Xi — Cij ~\~ Xj . 

Taking X n = v n = and adding these equalities along a path from i to n we 
conclude that A^ — Vi, the length of the shortest path from i to n. 

Moreover, as b\ = • • • — b n -i = 1, the dual problem, with X n = 0, is 

n — 1 

maximize Xi , subject to Xi < + for all (i, j) G A. 

i=l 

It follows that in the optimal solution, A, if all components are fixed except for 
A^, then we should set Xi as large as possible subject to the feasibility constraint. 
That is, A^ satisfies 

Xi = min {c lk + A fc } , i = 1, . . . , n - 1 . 

k: (i,k)£A 

with X n = 0. These are known as Bellman's equations. 




The idea is that if we are looking for the shortest path from j to n then we 
should choose the first arc of the path (i, k) by minimizing over path lengths 
c-ik + Afc. This method is also known as dynamic programming. 



12 Algorithms for Shortest Path 
Problems 

12.1 Bellman- Ford algorithm 

Let Vi(i) be the length of the shortest path from i to n which uses at most t arcs. 
We have v n (t) = for all t and Vi(0) = oo for all i ^ n. Then 

Vi(t + 1)= min {cik + Vk(t)} , z = l,...,n-l 

k: (i,k)£A 

defines the Bellman-Ford algorithm for solving the shortest path problem. 

It is a label-correcting algorithm. If we assume that there are no negative 
length cycles then Vi(n — 1) = Vi and allowing further additional arcs cannot 
reduce the length, so that Vi(n) — Vi(n — 1). 

The Bellman- Ford algorithm has running time O(mn), where n is the number 
of nodes and m is the number of arcs, since there are at most n iterations and at 
each iteration each arc is examined once. 

To find the shortest paths and not just their length vi we could record a 
successor node, s(i) to i as he first node along the shortest path from i to n. 
Whenever we have Vi(t + 1) < Vi(t), we delete the old successor of i, if any, and 
let s(i) be such that Vi(t + 1) = c is ^ + v s ^(t). 

12.2 Dijkstra's algorithm 

Dijkstra's algorithm is a label-setting algorithm. It can only be applied when all 
arc lengths Cij are non-negative. The idea is to collect up nodes in the order of 
their increasing shortest path lengths, starting from the node with shortest path 
length. To ease exposition, suppose all arcs are present, taking = oo for some 
node pairs if necessary. 

Lemma 12.1 Suppose that > for all Let I 7^ n be such that 

c £n = min c in . 

Then vg = cg n and vg < Vk for all k 7^ n. 

Proof A path from node k to n has a last arc, say (i,n) whose length 

least C£ n . For node £, we also have vg < Q n . Thus vg = cg n < Vk for all k 7^ n. ■ 
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Dijkstra's algorithm is 

1. Find a node t 7^ n such that C£ n < ci n for all i 7^ n. Set V£ = C£ n . 

2. For every node i 7^ £,n set Q n = min{Q n , cm + Q n }. 

3. Remove node I from the graph and return to step 1 to apply the same steps 
to the new graph 

Remarks 

1. The running time is 0(n 2 ) where n is the number of nodes. This follows since 
there are n iterations each involving a comparison and update of arc lengths 
from each remaining node. 

2. In the case of dense graphs, with arcs numbering m = 0(n 2 ), this improves 
on the Bellman- Ford algorithm (which has computational complexity 0(mn). 
Dijkstra's algorithm is the best possible since any shortest path algorithm 
would need £l(n 2 ) operations just to examine every arc at least once. 

Example (n = 4) 



1. Iteration 1 gives £ = 3 and V3 = 1. 

2. Modify arc lengths 

C14 = min{oo, 9 + 1} = 10 and 
C24 = min{7, 8 + 1} = 7. 

3. Eliminate node t = 3 from the graph. 

4. Iteration 2 gives 1 = 2 and vi = 7. 

5. Modify arc length 

cu = min{10,2 + 7} = 9. 

6. Eliminate node 1 = 2. 

7. Node 1 is only node remaining so set 
V\ = C14 = 9. 
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12.3 Reformulation with non-negative Cij 

If Vi (i 7^ n) is the shortest path length from node i to node n then from the 
Bellman equations (dual feasibility) we have that 

Vi < + Vj , for all . 

50 that Cij = c^ + Vj — Vi > are non-negative arc lengths and along any path 
visiting nodes ii, . . . , i p 

p—i p—i p—i 

^ ] Cj k i k+1 — ^ ^ { c ikik+i + ^ifc+i ~~ ^ifc) = ^ip ~~ V h + ^ ] c j k j k+1 . 
k=l k=l k=l 

Hence, the shortest paths under the new arc lengths are the same as those under 
the original (possibly negative) arc lengths. 

This is useful when we wish to solve the all-pairs problem, that is, to find the 
shortest distances between all pairs of nodes. Here, if we have negative arc lengths, 
we would use the Bellman-Ford algorithm to obtain Vi for a given root node and 
then apply Dijkstra's algorithm to solve the n — 1 remaining problems using the 
non-negative costs, c^ which are defined in terms of the V{ just calculated. 

For dense graphs, with m = 0(n 2 ), the overall complexity is 

0(n 3 ) + (n - l)0(n 2 ) = 0(n 3 ) . 

This compares with a computational complexity of 0(n 4 ) for the Bellman- Ford 
algorithm to solve the all-pairs problem. 



12.4 Minimal spanning tree problem 

Given a network (A 7 ", A), with cost c^ associated with arc G A, find the 

spanning tree of least cost. This problem arises, for example, when we wish to 
design a communications network connecting a set of cities at minimal cost. 

Theorem 12.1 (MST property) Let U be some proper subset of the set of 
nodes, N. If (u, v) is an arc of least cost such that u E U and v G N \U then 
there is a minimal cost spanning tree that includes (u, v) as an arc. 

Proof Suppose to the contrary that there is no minimal spanning tree that 
includes (u, v). Let T be any minimal spanning tree. Adding (u, v) to T must 
introduce a cycle, since T is a spanning tree. Thus, there must be another arc 
(u', v') in T such that u' G U and v' G N \ U. If not, there would be no way 
for the cycle to get from u to v without following the arc (n, v ) a second time. 
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Deleting the arc (u',v') breaks the cycle 
cost is certainly no greater than the cost 
Thus, T' contradicts our assumption that 
includes (u, v). 



and yields a spanning tree T' whose 
of T since by assumption c uv < c u > v >. 
there is no minimal spanning tree that 



12.5 Prim's greedy algorithm for MST 

Labels the nodes, N = {1,2,. . . , n} and set U = {1}. Now construct U recursively 
using the property above. 

1. Find the cheapest arc (u, v) connecting U and N\U (breaking ties at random). 

2. Add v to U and repeat until U = N . 

Prim's algorithm takes 0(n 2 ) steps. Suppose each time we start step 1 we already 
know the shortest distance between U and every j £ U, say cjjj = minify C{j. 
Then it takes no more than n comparisons to find the lowest cost arc between U 
and N\U (by comparing all the cjjj for j £ N \ U). Having found a node v to 
add to U, we can now find the shortest distance between U' = U + {v} and any 
j in N \ U', say Cjj'j = minjcv/, cjjj}- Thus each step of the algorithm requires 
at most n comparisons, and the algorithm has n — 1 steps. 



Example. In this example, Prim's algorithm adds arcs in the sequence {1,3}, 
{3,6}, {6,4}, {3,2}, {2,5}. 



13 Branch and Bound 



13.1 Integer programming problems 

Can one obtain an optimal solution to an integer linear program (pathILP) 
by rounding the solution to its LP relaxation? No, not easily. 

Rounding may not be optimal 

Consider the problem 

maximize z = x\ + 5^2 , 

subject to 

x\ + I0x 2 < 20 , x\ < 2 , xt, x 2 £ {0, 1, . . .}. 
Without the integer constraints the optimum is: 

xi = 2, X2 — |; z = 11 . 

If we round x 2 (in the feasible direction) then x 2 — 1, z — 7. 
However, the optimal solution is 

x\ — 0, X2 = 2; z = 10 . 
The best integer solution is not the closest to the best non-integer solution. 

Rounding may not be feasible 

Suppose a LP has optimal solution x\ — 64, x 2 = 10, with feasible set 

— X\ + X2 < 3^ , £i + 2^2 < 16^ 
Neither xi = 6 nor a^i = 7 is feasible. 

A possible algorithm 

If all variables must be integers and the feasible set is bounded, then there are 
only a finite number of feasible solutions. So a possible algorithm is 

1. Try all solutions. 2. Compare them. 3. Pick the best. 

However, there may be very many solutions to compare. We would like a more 
efficient method of choosing between all possible solutions. 
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13.2 Branch and Bound technique 

A Branch and Bound technique can be used for many types of problem. Its 
speed is very much problem dependent. The idea is to divide up the space of all 
solutions in some sensible way so that there is a good chance we can reject large 
subsets of nonoptimal solutions without actually evaluating them. Suppose we 
wish to solve the problem: 

minimize f(x ), subject to x G X , 

where X is a finite set of feasible solutions. The method takes a 'divide- an d- 
conquer' approach, in which problems are broken into subproblems. The original 
problem is broken into one or more subproblems, the ith of which is to minimize 
f(x) over x E Xi. Subsequently, we break Xi into subproblems, continuing in 
this way until a subproblem is easy to solve. 

We also suppose that for any subproblem, in which / in minimized over a 
x E X' , where X' is a subset of X, we can calculate a lower bound such that 

«<min,/(,). 
Branch and Bound algorithm 

The algorithm keeps a list L of outstanding (active) subproblems and the cost U 
of the best feasible solution found so far. 

0. Initialize step. Set U = oo. Discard any obviously infeasible solutions. 
Treat the remaining solutions as one subset. Go to Step 2. 

1. Branch step. Use some branch rule to select one of the remaining subsets 
and break it into two or more subsets. Two common rules are: 

Best bound rule. We partition the subset with the lowest bound, hoping that 
this gives the best chance of an optimal solution and of being able to discard 
other, larger, subsets by the fathom test. 

Newest bound rule. We partition the most recently created subset, breaking 
ties with best bound rule. This has book-keeping advantages in that we don't 
need to keep jumping around the tree too often. It can save some computational 
effort in calculating bounds. 

2. Bound step. For each new subset, Y, calculate i(Y). 
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3. Fathom step. Exclude from further consideration any new subsets, Y, such 
that 

(a) £(Y) > U. 

(b) Y contains no feasible solutions. 

(c) We can find an optimal solution to the problem of minimizing / over Y 1 say 
x', so £(Y) = f(x'). If £{Y) > U, we eliminate Y by test (a). If £{Y) < U, 
we reset U = £(Y), store x' as the best solution so far and re-apply test (a) 
to all other active subsets. 

4. Stopping rule. If there are no remaining active subsets, stop. The best 
solution obtained so far is optimal. Otherwise, go to Step 1. 



13.3 A knapsack problem 

A hiker wishes to take some items on a journey. Which items he should take so 
that the total value is at least 9, but the total weight is a minimum? 



i 


1 


2 


3 


4 
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w % 
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6 
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1 


1.2 


0.75 


0.5 



Each of the 16 subsets of {1,2,3,4} is a possible solution. However, only 8 of 
these are feasible. The hiker's problem is 

4 4 

minimize XiWi , subject to XiVi > 9, and X{ G {0, 1} , for all i . 

i=l i=l 

1. Starting with X as the only subset, we take items in index order until the total 
value is at least 9. This gives U — 11. Since we must include at least one item 
and the least item has weight 1, we have £{X) = 1. 

2. Break X into two subproblems, X\ and Xo, such that the hiker does or does 
not include item 1 in his backpack. Clearly £(Xq) = 1 (since he must include at 
least one of the remaining items) and £{X\) = 5 (since item 1 is in the backpack. 
Neither subproblem can be eliminated by tests (a) or (b). So L = {Xq,Xi}. 

3. Break X into the subproblems Xqi, X 00 , such that the backpack does not 
include item 1, and does or does not include item 2. X 00 is infeasible and so 
we eliminate it. For Xqi we have £(Xqi) = 6. Now L = {Xqi,Xi}. 



A knapsack problem 
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4. Break X\ into two subproblems, Jn and Aio, which contain item 1, and do 
or do not contain item 2. We have $.{Xiq) = 5 and £(Xn) = 11. Hence Jn 
can be eliminated by test (a) and L = {X i, X w }. 

5. Break Xiq into subproblems Ann and Aioo, which contain item 1, do not 
contain item 2, and do or do not contain item 3. We eliminate Aiqo by test 

(b) . Clearly problem X W i is solved by x\ = {1,3}, f{x\) = 8. So following 

(c) we set U = 8 and L = {A i}. 

6. Break Xqi into subproblems X n and A io- Since £{X n) > U, we eliminate 
^ooi by test (a). As for A io it is infeasible, and so elimnated by test (b). 

L is now empty and so we are done. The optimal solution x\ = {1,3}. 




^010 -^011 ^100 ^101 

infeasible t = 9 infeasible opt= 8 



Note 1. There is a trade off between the number of iterations that the method 
takes and the effort we put into calculating the bounds. 

For example, in calculating £(Xq) we could notice that for items 2, 3 and 4 the 
value of Wi/vi is 6/5, 3/4, 1/2. So to fill the backpack to total value 9, without 
using item 1, requires a weight of at least 1 + 3 + 3(6/5) = 38/5, and we can put 
£(X Q ) = 7.6. Similarly, £{X X ) = 5 + 1 + 2(3/4) = 7.5. 

By a similar calculation we have ^(Aoi) = 6 + 1 + 2(3/4) = 8.5. So at after 
Step 5 we could eliminate Aoi and so Step 6 would be unnecessary. 

Note 2. Suppose we want all optimal solutions. In this case, we replace the 
fathom test (a) with £(Y) > U and change fathom test (c) so that if £{Y) = U we 
add additional incumbent solutions to the collection, and if £{Y) < U we throw 
away all current incumbent solutions and replace by new one(s). 



14 Integer Programming 



14.1 Dakin's method 

Consider the problem 

n 

minimize z = ^ Cj Xj 

3 = 1 

subject to 

1, . . . ,ra 
1, . . . ,n 

1,...,/ (I<n) 

If only some but not all of the variables (/ < n) are constrained to be integer- 
valued then we say the optimization problem is a mixed integer program. If 
all the variables are constrained to be integer- valued (/ = n) then it is said to be 
a pure integer program. 

Dakin's method applies to both pure and mixed forms of integer programs. 
It nicely combines the primal and dual simplex algorithm with Branch and Bound. 

Initialize step. Recall that the problem without the integer constraints is the 
linear programming relaxation. We use this as our method of generating 
lower bounds. 

We set U — oo and solve the LP relaxation of the problem with the primal 
simplex method. If optimal solution, say has Xj E Z, for j = 1, . . . ,/ then 
stop. This solution is optimal. Otherwise, pick a variable, say xj, that is meant 
to be integer but is not. Partition into two subsets by adding one or other of the 
constraints 

Xj < [x 3 \ or Xj > \xj] . 

Observe that Xi violates both of these constraints and that if x is the unique 
optimal solution to the LP relaxation then the optimal cost of each of the two 
subproblems will be strictly larger. 

Bound step. Solve the resulting LP relaxation of the problem with the new 
constraint. Use the dual simplex method, starting with the basic solution that 
was optimal before adding the new constraint. The dual simplex method works 
well for this task. Use the newest bound rule for greatest efficiency 



3 = 1 

Xj>0, j = 

Xj e z , j = 
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Fathoming. The fathoming tests become: 

1. If new optimal solution has £(Y) > U . 

2. If dual simplex implies that there are no feasible solutions. 

3. If the optimal solution is feasible for the original problem, i.e., has integer 
values for xi, . . . , xj and if £(Y) < U, then reset U = £(Y) and store x as the 
incumbent solution. 

14.2 Example of Dakin's method 

Consider the pure integer programming problem 

minimize x\ — 2x2 , 
subject to — 4xi + 6x2 < 9 , x\ + X2 < 4 , > , Xi, X2 E Z . 

1. Set U = 00. 

2. Solve the LP relaxation to obtain x 1 = (1.5, 2.5) with optimal cost —3.5. 

3. Create two subproblems by adding the constraints X2 > 3 (subproblem Pi) or 
X2 < 2 (subproblem P 2 ). 

4. The LP relaxation of Pi is infeasible and we can eliminate this subset of possible 
solutions. 

5. The optimal solution to P2 is x 2 = (0.75, 2), with optimal cost of —3.25. 

6. Partition P 2 into two subproblems according to the additional constraint X\ > 
1 (subproblem P 3 ) or x\ < (subproblem P4). 

7. The optimal solution to P3 is x 3 = (1,2) which is integer and therefore er 
record this as the best solution so far and set U — — 3. 

8. The optimal solution to P4 is (0, 1.5) with optimal cost —3 > U. So delete P4. 

9. There are no more unfathomed subproblems so we stop with optimal solution 
x 3 = (1,2). 
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14.3 Decision variables 

We have seen pure and mixed integer programming problems. A further class 
of practically useful programs arises from the constraints that the variables may 
take one of the two values or 1. We met one of these in Section 13.3. Such prob- 
lems are called binary, or zero-one programs. Their interpretation as decision 
variables can capture many modelling questions. We give some examples. 

Binary choice, the 0—1 knapsack problem 

We have n items with item j having a size Wj and value v j . Given a bound K on 
the overall size of our knapsack, how should we select the items to maximize the 
total value? 

n n 

maximize VjXj , subject to WjXj < K , Xj G {0, 1} , j — 1, . . . , n . 
Contingent decisions 

A typical feature of discrete problems is that the variables are dependent in that 
decision x can be made only if y is also made. We can capture this dependence 
by the constraint x < y. So that if y = then x = 0. 

Example: the facility location problem 

Suppose that there are n potential facilities and that it costs Cj to build facility 
j — 1, . . . , n. There are m clients wishing to use one of the facilities and suppose 
that it costs dij for client i = 1 , . . . , m to use facility j . 



Decision variables 
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The facility location problem is to choose which facilities to build in order 
to minimize the total cost of construction and use. We can formulate the problem 
as follows. 



minimize Cjyj + dijXij 

j=l i=l j = l 

subject to 

n 

^ Xjj - 1 , z=l, ...,ra 
3=1 

Xij <yj, i = 1, • • • ,m, j = 1, . . . ,n 
a;ij,2/j G {0, 1} , j = 1, . . . ,n. 

Note that feasible solutions satisfy a^j = for all % = 1 , . . . , m whenever 
Uj = 0. So that clients do not attempt to use a facility that is not constructed. 

Relations between variables 

Consider constraints of the form 

n n 

5^Xj<1, or 5^ = 1, 

3=1 3=1 

where all variables are binary. The first implies that at most one of the variables 
can take the value 1. The second implies that precisely one of the variables must 
be 1. 

Restricted ranges of values 

Suppose a variable x is restricted to the values {a±, . . . , a m }. Introduce m binary 
variables yj, j = 1, . . . , m together with the constraints 



m m 

x = ^2 a jV3i X^" = 1 ' 2/je{0,l} 



15 Travelling Salesman Problem 

15.1 Categories of algorithms 

Given an undirected graph G = (TV, A) consisting of n nodes and m arcs together 
with costs Cij for each arc G A, the travelling salesman problem (TSP) 

is to find a tour of minimum cost. 

1. Exact algorithms are guaranteed to find an optimal solution but may take 
an exponential number of iterations. An exact algorithm for TSP is to write 
it as an ILP and solve it using branch and bound. 

2. Approximation algorithms have polynomial worst-case time complexity, 
supplying a suboptimal solution with a guaranteed bound on the degree of 
suboptimality. We shall look at such an algorithm for TSP, based on the 
minimum spanning tree. 

3. Heuristic algorithms supply suboptimal solutions without any bound on 
their quality. They do not necessarily provide polynomial running times, but 
empirically they often provide a successful tradeoff between optimality and 
speed. We look at two approaches that have been used for TSP: local search 
and simulated annealing. 

15.2 Exact methods 

Set Xij — 1 or as 6 A is or is not present in the tour. Define 

5(S) = {(i,j)eA :ieS,j#S}. 
For a tour there must be two arcs incident to every node so 

=2, ieN. (15.1) 

(ij)es({i}) 

Furthermore, for any partition of the nodes into subsets S and N\S there must 
be at least two edges connecting S and N\S. So we must also have 

^2 x ij > 2 > for a11 S CN 7 such that S ^ or N . (15.2) 
The so-called cutset formulation of the TSP is therefore 



59 



Polynomial formulation of TSP 
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minimize 2^ 

CijXij 

(i,j)eA 

subject to (15.1), (15.2) and Xij E {0, 1}. Notice that we have exponentially many 
constraints, since there are 2 n — 2 constraints of type (15.2). 
Alternatively, we can replace (15.2) by 

X H ^ \ S \ - X ' for all 5 C TV such that S ^ or N . (15.3) 

This constraint ensures there is no cycle involving less than all n nodes. Again 
we have an exponential number of constraints. This is called the subtour elimi- 
nation formulation of the TSP. The LP relaxations of these two formulations 
have the same feasible sets (though this is not obvious). 

15.3 Polynomial formulation of TSP 

Think of the undirected formulation of the TSP. The the salesman must on leaving 
a city he must next visit one and only one city, and, similarly, on arriving at a 
city he must have come from one and only one city. Therefore we must have 

Xij = l, i = 0, 1, . . . ,n - 1 (15.4) 
x ij = h j = 0, 1, . . . ,n - 1 . (15.5) 

i:(i,j)EA 

These constraints are not sufficient to ensure that the solutions do not consist 
of several subtours such as is shown here. 









2) 










5) 





Consider a tour so = 0, si, S2, . . . , s n -i- Let t{ be the position in the tour at 
which city i is visited. So, we have to = 0, t Sl = 1, and in general, 
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We require that if Xij = 1 then 



tj=U + l. 



Also, ti is an integer between and n 



1. Hence 




U - (n 
U + l 



1) if 
if 







1 . 



These constraints can be written as 



tj > U + 1 - n(l - x^) , i > 0, j > 1, z 7^ j . 



(15.6) 



It turns out that these constraints also rule out subtours. To see this, suppose 
we have a solution which satisfies these constraints and consists of two or more 
subtours. Consider the subtour that does not include city 0, and suppose it has 
r > 2 arcs. Summing the constraints over the arcs of this subtours leads to the 
condition 



and hence there can only be a single tour visiting all the cities. 

Thus the TSP can be formulated as an ILP in n 2 +n variables and 2n+n(n — 1) 
constraints. Namely, 



subject to (15.4), (15.5), (15.6), x^ G {0, 1}, t = and U G {0, 1,2,.. .}. 

15.4 Solution using branch and bound 

Notice that by dropping the constraints establishing the lack of subtours we are 
left with an assignment problem, which can be efficiently solved by the network 
simplex to provide a lower bound on the optimal solution. 

We need not worry about the relaxation to non-integral solutions since the 
network simplex algorithm will always find an integer solution. Thus we consider 



minimize 2_^ Ci J Xi J > subject to (15.4), (15.5) and x^ G {0, 1}. 



i,3 

If the optimal solution corresponds to a tour visiting all the cities then it is 
optimal for the original travelling salesman problem. 

If not, we continue with a branch and bound algorithm, using a branching rule 
that breaks the problem in two by an additional constraint of the form x^ = 0. 



> r 



minimize 





Approximation algorithm for the TSP 
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Think of this as re-setting a cost, cij = oo. The addition of such a constraint leaves 
us with a valid travelling salesman problem and a valid assignment problem which 
provides a corresponding lower bound. 

A natural way to select an Xij = constraint is to choose one or more of the 
subtours and eliminate one of their arcs. 

If the current assignment problem has a unique optimal solution, this solution 
becomes infeasible with the addition of a constraint during branching. Hence, the 
optimal cost of each subproblem is strictly larger, and increasing lower bounds 
are obtained. 

15.5 Approximation algorithm for the TSP 

Definition 15.1 An e- approximation algorithm for a minimization problem 
with optimal cost Z opt runs in polynomial time and returns a feasible solution with 
cost Z app , such that 



It is usually difficult to determine approximation algorithms for any e > and 
we shall develop one for the TSP only for e = 1, and when the costs satisfy 
the triangle inequality. 

Consider the undirected TSP with costs satisfying 



Now suppose that M is the cost of the minimal spanning tree. This can be 
obtained easily using Prim's greedy algorithm. Consider any starting node and 
traverse the minimal spanning tree to visit all the nodes. This uses each arc of 
the spanning tree exactly twice, with total cost 2M. 

This path can be converted into a tour visiting all the cities by skipping any 
intermediate node that has already been visited. By the triangle inequality, the 
resulting tour will have cost bounded above by 2M. Also, every tour contains a 
spanning tree (since dropping any one arc leaves a spanning tree) and so has cost 
at least M. 

Thus a straight-forward algorithm based on the minimal spanning gives 




Cij < c ik + c fc j, for all i, j, k . 



Z : 



app — 



<2M < 2Z, 



opt • 



It is an approximation algorithm with e = 1. Observe the essential importance 
played by the triangle inequality. 



16 Heuristic Algorithms 



16.1 Heuristics for the TSP 

Nearest neighbour heuristic. Start at some city and then visit the nearest 
city. Continue to visit the nearest city that has not yet been visited, continuing 
until a tour is complete. 

Although usually rather bad, such tours may only contain a few severe mis- 
takes. They can serve as good starts for local search methods. 



Cheapest insertion heuristic. This is also a greedy algorithm. Start with a 
single node and then, one by one, add the node whose insertion makes the smallest 
increase to the length of the tour. 



Furthest insertion heuristic. Insert the node whose minimal distance to the 
exisiting tour node is greatest. The idea is to determine the overall layout of the 
tour early in the process. 



Savings heuristic. Rank the arcs in ascending order of cost. Add the arcs in 
this order, so long as they do not violate any constraints, and until all cities have 
been visited. 






OptLrrkat Tour 



16.2 Neighbourhood search 

Consider the general problem 

minimize c(x), subject to x G X. 

Suppose that for any point x G X we have a set of 'neighbouring points', N(x) C 
X. The basic approach of local search is as follows 

1. Select some x G X. 
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2. Evaluate c(x). 

3. Pick some y G N(x) and evaluate c(y). 

If c(?/) < c(x) then select y as new value for x and return to step 2. 

If there is no such y G N(x) with c(y) < c(x) then stop with solution x. 

Remarks 

1. A specific implementation must specify, in a problem-dependent way: 

(a) the neighbourhood sets, N(x), for all x G X; 

(b) the procedure for selecting y G N(x). 

2. There are various ways to modify local search. 

(a) We might use some rule to guess a good starting point or try multiple 
starting points. 

(b) We might choose the best neighbour y G N(x) with least value of c(y) not 
just the first y that improves the solution. 

(c) We might choose the best neighbour amongst the first r considered. 

3. The simplex algorithm for linear programs is a local search method. We can 
say that two basic feasible solutions are neighbours if they are connected by 
an edge of the constraint set. 

In linear programming any local optimum is the global optimum. 

16.3 Neighbourhood search methods for TSP 

Consider the TSP. By a feasible solution x, we mean the indicator function for 
the arcs in some tour of the network. 

There is a fairly natural family of neighbourhoods for any tour x generated 
by the operation of removing any k > 2 arcs from the tour and replacing them 
with k new arcs that also make a new tour. For example, when k = 2 (known as 
20PT) each tour has 0(n 2 ) neighbours. For k = 3 there are 0(n 3 ) neighbours 
for each tour x. 
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Note that it only takes time that is 0(1) to compute the change in cost between 
neighbouring tours. 

Empirical evidence is that 30PT performs better than 20PT, but there is 
little further gain in taking k > 3. 

In general, there is trade-off of solution quality and speed. The larger the 
neighbourhoods N(x) the fewer local minima there are, and the better the solu- 
tion. However, more work must be done per iteration so the method is slower for 
larger neighbourhoods. 

In practice, we fix on a certain neighbourhood size but then repeat the algo- 
rithm with a variety of starting values. 



Example TSP using 20PT 

Suppose we have a distance matrix 
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A feasible solution is a cycle that visits all the nodes (without re-using the arcs). 
Here are the 4! = 24 feasible tours and costs c(x) 



ABCDE ( 5) 
ABCED ( 6) 
ABDCE ( 6) 
ABDEC (10) 
ABECD (10) 
ABEDC (17) 



ACBDE ( 6) 
ACBED (12) 
ACDBE (10) 
ACDEB ( 6) 
ACEBD ( 0) 
ACEDB (12) 



ADBCE (10) 
ADBEC (20) 
ADCBE (17) 
ADCEB ( 9) 
ADEBC (10) 
ADECB (17) 



AEBCD ( 6) 
AEBDC (12) 
AECBD (12) 
AECDB (17) 
AEDBC (17) 
AEDCB (20) 



So, ACEBD is global optimum but we can get stuck in ABCDE since none of its 
neighbours (under 20PT) is better. 



16.4 Simulated annealing 

In this method we try to overcome difficulties of getting stuck in potentially poor 
local minima by permitting the algorithm to jump out of them. 

The basic idea is to allow up jumps to worse neighbours in initial phase but 
get gradually more reluctant to permit up jumps. 

The simulated annealing method permits a move from x to y G N(x) with 
probability 

c(y) - c(x)]\ 



p xy = min 1 , exp 



Genetic algorithms 
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where T starts large and decreases with each iteration. It can be shown, under 
suitable conditions, that if you start with T large enough and decrease it slowly 
enough then 

lim P(x(t) is optimal ) = 1 . 

t — >oo 

As motivation for this claim, imagine that all solutions have k neighbours, and 
at each step of the algorithm one of these neigbours is chosen at random. Then 
x moves next to its neightbour y with probability P xy = p xy /k. By checking that 
the 'detailed balance' of 7r(x)P xy = 7r(y)P yx holds we have that the stationary 
distribution of this Markov chain is 

e -c(x)/T 

7r( x ) = — , where A = ^ e ~ c ( z )/ T . 

zex 

Let Y be the set of optimal solutions. In this stationary distribution tt(Y)/(1 — 
tt(Y)) -> oo as T -> 0. 

A common temperature schedule is to let T decrease with iteration number, 
t, according to 

V J log* 



16.5 Genetic algorithms 

Genetic algorithms can be applied to many problems, are often easy to imple- 
ment, but may get stuck in a local optimum. We illustrate the basic steps for the 
TSP. 



Create a random initial state. This is a population of tours, each of which 
is list of cities, analagous to a list of chromosomes. 



Evaluate fitness. A value for fitness is assigned to each solution, e.g., the 
length of the tour. 



Reproduce. Those chromosomes with a higher fitness value are more likely to 
reproduce offspring E.g. 'greedy crossover' selects the first city of one parent, 
compares the cities leaving that city in both parents, and chooses the closer one 
to extend the tour. If one city has already appeared in the tour, choose the other 
city. If both cities have appeared, randomly select a non-selected city. 



Mutate. Randomly swap over a pair of cities. 



17 Two-person Zero-sum Games 



17.1 Terminology 

Game theory is the study of multi-player decision problems. The common theme 
is conflicts of interest between the different players. It assumes that each player 
plays the 'best way' he can. This may not happen in practice. Game-theoretic 
analysis is often more descriptive than prescriptive. 

• Players. Labelled 1, 2,3,.. ., or I, II, III, . . . 

• Moves. A move is either a decision by a player or the outcome of a chance 
event. 

• Game. A game is a sequence of moves, some of which may be simultaneous. 

• Payoffs. At the end of a game each player receives a return. The payoff to 
each player is a real number. If a move has a random outcome we use an 
expected payoff. 

• Strategy. A strategy is a description of the decisions that a player will make 
at all possible situations that can arise in the game. 

• Zero-sum. The game is said to be zero-sum if the sum of the players' payoffs 
is always zero. 

• Perfect information. A game is said to have perfect information if at 

every move in the game all players know all the moves that have already been 
made (including any random outcomes.) 

17.2 Two-person zero-sum games 

We begin with zero-sum games between two players, labelled I and II. Each player 
has a finite collection of pure strategies. Player I has strategies Ii, I2, . . . , I n 
and player II has strategies Hi, II2, . . . , H m - 

Let eij denote the (expected) payoff to player I when he uses strategy Ii and 
player II uses strategy ILy. The normal form representation of the game is 
given by the matrix of payoffs (eij). 

This representation is in terms of strategies. It does not include detailed 
information about the sequences of moves or whether or not the players have 
perfect information. 
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17.3 Maximin criterion 

In a non-zero-sum game, we must record both players' payoffs, say ei(i,j) and 
^2 So in a zero-sum game ei(i,j) = — e2(i,j) = e^. Player I wins what 
player II loses. Thus when player II tries to maximize his payoff he is also trying 
to minimize the payoff of player I. This means that player I should look at 
the payoff he would receive if he plays strategy Ii, i.e., mim, e^-, and choose the 
strategy which has the largest of these minimum payoffs. This is known as the 
maximin criterion. 

Using this criterion, player I can guarantee that his payoff is at least, vl, the 
lower value of the game, where 

vl = max min . 

i 3 

Similarly, player II can guarantee that player I's payoff is no more than, vjj, 
the upper value of the game, 

vjj = min max e^- . 

3 i 



Example (a) 



Ii 

h 
h 

max using 11 j 



III 


II 2 


Us 


II 4 


II 5 


He 


1 


1 


-1 


1 


1 


-1 


-1 


1 


-1 


-1 


1 


-1 


1 


1 


1 


-1 


-1 


-1 


1 


1 


1 


1 


1 


-1 



min using Ij 

-1 
-1 
-1 



Thus, vl = max{— 1, —1, —1} = —1 and vjj = min{l, 1, 1, 1, 1, —1} = —1. When, 
as in this example, we have v l = vjj for a pair of pure strategies, there is said 
to be a saddle point solution. 

Example (b) 



Ii 
I 2 

max using IL, 



III 


H 2 


min using 1^ 







-1 


1/2 


-i) 


-1/2 











Thus, vl = max{ — 1, —1/2} = —1/2 and v\j = min{0, 0} = 0. 
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17.4 Mixed strategies 

In Example (b) vl is strictly less than vjj- Suppose we enlarge the set of pos- 
sible strategies by randomization, allowing player I to choose strategy 1^ with 
probability pi and player II to choose strategy Ilj with probability qj. We say 
that player I adopts the strategy p = (pi,P2, • • • ,Pn) and II adopts the strategy 
q = (qi, Q2, • • • , Qm)- The expected payoff to I is 

n m 

e(p,q) = J2J2 Pie v q i- 

i=l j = l 

Suppose that in Example (b) player I takes p = (p, 1 — p) and player II takes 
q = (q, 1 — q). Then the expected payoff to player I is 

e(p, q) = Opxqi - lpiq 2 - \piqi + §Piq2 
= -Pi ~ \qi + 

= f(Pi-i)fe-i)-i- 

Define 

i?|f = max min e(p, q) and Vjj = min max e(p, q) . 

p q q p 

Notice that if I plays p* = (1/3, 2/3) and II plays q* = (2/3, 1/3) then 
e {p* ■> q) = ~~ 1/3 for all q , and e(p 7 <?*) = — 1/3 for all p. 

It is clear that vff < vff . Playing p* guarantees I at least —1/3, so vff > —1/3. 
If II plays q* then I will be restricted to no more than —1/3, so vff < —1/3. Hence, 
we must have 



17.5 Minimax theorem 

Theorem 17.1 (Minimax theorem) Consider a two-person zero-sum game in 
which I has n strategies and II has m strategies (both finite). Then 

vff = max min e(p, q) = min max e(p, q) = v\j . 

p q q p 

If p* and q* achieve the maximin criterion of the theorem then 

/ * *\ M M 

e{P ,q ) = v L =v u = v. 

We say that v is the value of the game and that the value together with the 
optimal strategies, p* and q* are the solution to the game. 
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17.6 Dominating strategies 

Consider the payoff matrix 





III 


n 2 


ii 3 


II 4 


II 
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6 


4 
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3 


4 


h 




4 


5 


5 



No matter which strategy player I follows his payoff is always the same or less 
under Hi than II4. Thus, II will always do as well to use Hi rather than II4. 
We say that Hi dominates II4, and we may remove it from the strategy set. 
Similarly, II 2 dominates II3. So the game reduces to 





III 


II 2 


II 




5 


I 2 


(i 


2 


Is 




4 



But now we may iterate this procedure. Since Ii dominates both I 2 and I3 in 
the reduced game, though not in the original game, we have 

Hi II 2 
Ii ( 4 5 ) 

Lemma 17.1 If a dominated strategy is removed from a game then the solution 
of the reduced game is a solution of the original game. 

Suppose that I 2 dominates Ii in the original n x m game and that p*, q* and 
v is a solution to the reduced game when Ii is omitted. 

Then e(p*,q) > v for all q and e(p,q*) < v for all p = (0,p 2 , • • • ,Pn)- We need 
to show that 

e(p*, (?) > i> , for all q , and e(p,q*)<v, for all p. 
Take any p — (pi,p2, ■ ■ ■ ,Pn) then 

n m m n m 

e(p, q*) = J2J2 p ' e v q *J =Pl Yl ei ^j + EE^ e ^l 

i=l j=l j=l i=2 j = l 

m n m 

< Pi ^2 &2 i q *j + EE ftei i ? i = e ^'' q *"> 

3 = 1 «=2 3=1 

where p' = (0,pi +P2,P3, • • • ,Pn) is a strategy in the reduced game. Thus 

e(p,q*) < e{p ! ,q*) <v, for all p = (pi,p 2 , ■ ■ ■ ,p n ) ■ 
Simlarly, we can show e(p*,q) > v for all q. ■ 
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18 Solution of Two-person Games 

18.1 Equilibrium pairs 

A pair of strategies p* and q* is an equilibrium pair if for any p and q 

e{p,q*) < e{p*,q*) < e(p*,q) . 
It is possible for there to be more than one equilibrium pair. In the game 

Ii 
h 

both (Ii, Hi) and (Ii , II3) are equilibrium pairs. Indeed, p* = (1,0) and q* = 
(q, 0, 1 — q) are equilibrium pairs for any < q < 1. But in all such cases 
e(p*,q*) = 1. 

Lemma 18.1 If(p, q) and (p', q') are both equilibrium pairs then e(p, q) = e(p f , q'). 

Proof. Since (p, q) and (p f , q') are both equilibrium pairs, we have 

e(p', q) < e(p, q) < e(p, ^) and e(p, g ; ) < e(p', 9') < e(p', q) . 
Together, these imply e(p,q) = e(p',q'). ■ 

Theorem 18.1 A pair of strategies {p*,q*) in a two-person zero-sum game is an 
equilibrium pair if and only if (ft* ,q* ,e(p* ,(?*)) is a solution to the game. 

Proof. If (p*,q*) is an equilibrium pair then 



maxe(p, g*) < e(p*,q*) < min e(p* 7 q) 
p q 



Then 



Vjj = minmaxe(p, q) < maxe(p, q*) < e(p*,q*) 



p 



M 



< min e(p*,q) < maxmine(p, q) = v L 

q p q 

So, since vff < vff we must have vf? = v{f = e(p*,q*) so that (p* , q* , e(p* , q*)) 
is a solution of the game. 
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Conversely, if (p* , q*, e(p*, q*)) is a solution of the game then 
e(p, q*) < maxe(p, q*) = minmax e(p, q) = e(p*,q*) 

P Q P 

= maxmin e(p, q) = min e(p*,q) < e(p* ,q) . 

p q q 

Hence, {p* 7 q*) is an equilibrium pair. 

18.2 Solving two-person zero-sum games 

We want to find probability vectors p*, q* and a v such that 

e(p, q*) < e{p*,q*) = v < e(p* : q) 

for any p and q. The first inequality implies that 

m 

e(Ij, q*) = ^2 e ijQj - v ' for alH = 1, 2, . . . , n . 

3=1 

Player II chooses q* so as to make v as small as possible. So his problem is 



m m 



minimize < v : ZjjQj < v, qj > 0, ^^Qj — 1 
1 i=i i=i 

Let, Qj = qj/v then Qi + Q2 H h Q™ = YlJLi Qj/ V = Thus, assuming 

i> > 0, minimizing 1; is equivalent to maximizing 1/w so that the final problem is 

maximize Qi + Q 2 H h Q m 

subject to CijQj < 1 , i — 1, • • • , n , and > , j = 1, . . . m . 

3=1 

To ensure n>0we can add a constant to every payoff. This will not change the 
optimal strategy only the value. Consider the dual problem given by 

minimize Pi + P2 + • • • + P n 

n 

subject to P%e%j > 1 , j — 1, 2, . . . , m , and Pj > , z = 1, 2, . . . , n . 

i=i 

Interpret this as Pi = Pi/v for i = 1, 2, ... , n and rewrite as 

f n n 

maximize <N : ^ 

PiCjj > V 7 Pi > 0, — 1 



i=l i=l 
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Thus we can solve the game by solving the primal (or dual) LP. Solving the 
primal gives the value of the game and player IPs optimal strategy q* while the 
dual problem gives player Fs optimal strategy p* . 

18.3 Two-person non-zero-sum games 

The players are not completely antagonistic to one another and might both be 
happier with one outcome than another. For non-zero-sum games, (a) a maximin 
pair is not necessarily an equilibrium pair and vice versa; (b) equilibrium pairs 
don't necessarily have the same payoffs; (c) there is no obvious solution concept 
for the game. 

Write ei(-, •) for player Fs payoffs and e2(-, •) for player IFs payoffs. A pair of 
strategies (p*,q*) is an equilibrium pair if for any p and q 

ei(p,Q*) < ei(p*,g*); e 2 (p*,q) < e 2 (p*,q*) . 
Example: Prisoner's Dilemma 

Don't confess Confess 
Don't confess / (5,5) (0, 10) \ 

Confess ^ (10,0) (1,1) ) 

The equilibrium pair is (Confess, Confess) with payoffs (1,1). However this is 
worse for both players than (5, 5), where both players don't confess. The 'confess' 
strategy dominates the 'don't confess' strategy yet 'most people' would regard 
(Don't confess, Don't confess) as the 'best' solution. 

Example: Coordination game 

Opera Football 

Opera / (1,4) (0,0) \ 

Football V (0,0) (4,1) J 

There are three equilibrium pairs: (Opera, Opera) with payoff (1,4), (Football, 
Football) with payoff (4, 1) and a third one consisting of the mixed strategies 
(1/5,4/5) and (4/5,1/5) which gives payoffs of (4/5,4/5). The difficulty is to 
persuade the other player to do the same as you. Compare this with flipping 
a coin and both going to the opera or both to football and sharing the payoffs 
evenly; this requires cooperation. 

Theorem 18.2 (Nash' Theorem) Any two-person game (zero-sum or non-zero- 
sum) with a finite number of pure strategies has at least one equilibrium pair. 
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Proof. Set P = {p : pi > 0, J2iPi = 1}> Q = U '■ Qj > °> HjQj = 1} and 
define 

# = : peP, q eQ}. 

Then 5 is closed, bounded and convex. For any p G P and q G Q define 

Ci(p, g) = max{0, ei(Ii, q) - ei(p,q)} , i = 1, 2, . . . , n 

as the amount I gets extra by playing Ij rather than p against q. Let 

dj(p,q) = max{0,e 2 (p,IIj) - e 2 {p 1 q)} , j = 1,2, . . . ,ra 

be the amount II gets extra by playing ILy rather than g against p. 
Define the function f(p,q) = (p',q f ), where 

, Pi + Ci{p,q) , qj+dj{p,q) 

Pi — ; — 7 \ ano - ^7 — 



for all i = 1, 2, . . . , n, j = 1, 2, . . . , m. Then / is continuous so, by the Brouwer 
Fixed Point theorem, there is a fixed point (p*,q*) such that 

/(pW) = (pW)- 

Observe that we cannot have e\(Ii,q*) > ei(p*, g*) for all « = 1, 2, . . . , n since 
that would imply 

n n 

ei(p*,?*) = X>* e i( 7 ^*) > $>?ei(p*,0 = 

i=l i=l 

Thus for some i we have that 

But since (p* ,q*) is a fixed point of / we have 

Pi +Ci(p*,q*) 



* 

Pi 



1 + Ci'ijP*A*) 
and for the choice of i with Ci(p*,q*) = we see that 

^ Ci/(p*,g*) = 0. 
i'=i 

Thus, for alH' = 1, 2, . . . , n we have that c*> (p*, <?*) = and so 
ei(p*,^*) > ei(/i,5*), i= l,2,...,n, 

and hence 

e i(p*5 Q*) > e i(p ? 9*) ? for all p£P. 

A similar argument shows that e2(p*, <?*) > e2{p*, q) for all q G Q and so the fixed 
point {p*,q*) is an equilibrium pair. ■ 
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19.1 Finding equilibrium pairs 

Example. Consider 

Hi H 2 

h / (3,2) (2,1) \ 

h \ (°> 3 ) ( 4 > 4 ) J 

Suppose we fix q and find p to maximizes ei(p,q). If q is part of an equilibrium 
pair we have constructed it's partner strategy for player I. Then 

ei(p, q) = Pi (5^i - 2) + 4 - 4gi . 
where p = (pi,P2) and g = (51, #2)- Thus the maximizing pi is given by 

10 if q 1 < 2/5 

any < pi < 1 if gi =2/5 
1 if 51 > 2/5. 

Similarly, for player i7, we consider e2(p, q) = <?i(2pi — 1) + (4 — 3pi) . and 
find the maximizing q\ as 

10 ifpi<l/2 
any < qi < 1 if pi = 1/2 
1 ifpi>l/2. 

Thus we look for the mutual solutions to these two simultaneous maximization 
problems to find the three equilibrium pairs: 

1. pi = 0,gi = 0, corresponding to (I2, II2) with payoffs (4,4); 

2. pi — l,qi = 1, corresponding to (Ii, Hi) with payoffs (3,2); 

3. pi = l/2,gi = 2/5, corresponding to ((1/2, 1/2), (2/5, 3/5)) with payoffs 
(2.4,2.5). 

Notice that the payoffs differ, but that given an equilibrium pair neither player 
has any incentive to alter his strategy. 
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19.2 Cooperative games 

Consider the game 

Hi H 2 

11 ( (0,0) (1,1) \ 

12 V (3' 1 ) (1.3) J 

For cooperative games, the payoff region is larger than in the corresponding 
non-cooperative game. If (iti,i>i) and (^2,^2) are payoffs in the non-cooperative 
game, then in the cooperative game the strategy which chooses strategies giving 
payoffs (ui,vi) and (^2,^2) with probabilities (3 and (1 — (3) has expected payoff 

(3(ui,vi) + (1 - (3)(u2,v 2 ) ■ 

Thus the payoff region, R, for the cooperative game is the convex hull of the 
payoff region for the non-cooperative game, i.e., it is the smallest convex region 
that covers the payoff region for the non-cooperative game. In fact, R is most 
simply obtained by constructing the convex hull of the points corresponding to 
each player adopting one of his pure strategies. Points corresponding to the use of 
pure strategies may be in the interior of the convex hull; e.g., the pair of strategies 
(Ii, II2) in the game above. 

Notice the different way that randomization is used. In a non-cooperative 
game each player randomizes over his choice of strategy. This is done to confuse 
the opponent. In a cooperative game players jointly randomize over pairs of strate- 
gies, which may themselves be mixed strategies for the game. Randomization it 
is used to average between possible outcomes. 

19.3 Bargaining 

In a cooperative setting, there is a preplay stage or negotiating stage where the 
players decide on the strategies to be used. Of the possible payoffs, which are the 
players likely to agree on? We say that a pair of payoffs (it, v) in a cooperative 
game is jointly dominated by (it', 1/) if 

u' > it, v' > v and {u ,v') 7^ (it, i>) • 

A pair of payoffs (it, v) is said to be Pareto optimal if it is not jointly dominated. 
Certainly, the players will only be interested in agreeing on Pareto optimal payoffs 
since otherwise there is a payuoff such that both can do as well and one can do 
better. 

Players I and II can always guarantee themselves payoffs of at least 



v\ = max min e\ (p, q) and v\\ = max min e 2 (p, q) 

p£P qeQ q€Q p£P 
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respectively. So, we would expect the solution of a cooperative game to lie within 
the bargaining set (also called the negotiation set) given by 

B — {(u, v) : u > vi, v > vu, and («, v) is Pareto optimal in R} . 

How ought the players agree which point of B is to be used? 

Note that we must not make inter-player comparisons of payoffs, which we are 
supposing are measured by their own utilities, not necessarily on the same scale. 
We should not conclude that I prefers (4, 1) to (1, 10) by less than II prefers (1, 10) 
to (4,1). 

Status quo point 

Nash suggested that within the set of feasible payoffs, R, jointly available to the 
players there is also a special payoff, (uq,vo) G R called the status quo point, 
which is the outcome the players will receive if they can't agree by a process of 
negotiation. 

An arbitration procedure, is defined as a map from the status quo point 
and the region R to some point (u*,v*) G R 

V((u o ,v ),R) = («*>*). 

19.4 Nash bargaining axioms 

1. feasibility. (u*,v*) G R. 

2. at least as good as status quo u* > uo, v* > vq. 

3. Pareto optimality. If (u, v) G R and u > u*, v > v* then u = u*, v = v*. 

4. symmetry. If R is symmetric, so that if (u, v) G R (v, u) G R } and if 
Uq — Vo, then u* = v* . 

5. invariance under linear transformations. Let R' be obtained from R by 
the transformation 

v! = au + b , v' = cv + d , a, c > . 

Then if (u*,v*) = ^{(u ,v ),R) then qj((au + b, cv + d), R') = {au* + b,cv* + 
d). 

6. independence of irrelevant alternatives. If R' is a subset of R, ^((uo, vo),R) = 
(u*,v*) and (u*,v*) G R', then we must also have ty((u ,v ), R f ) = (u*,v*). 
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19.5 Nash's arbitration procedure 

For (u, v) G R with u > «o, v > vq define the function 

f(u,v) = (u - uo)(v - v ) . 

If there exists points (u,v) E R with u > uo, v > vq then / attains a unique 
maximum at some point (u*,v*) G R. Define 

V((u ,v ),R) = (u*,v*). 

Nash showed that ^ is the only function that satisfies the axioms (1-6). 

19.6 Maximin bargaining solutions 

Nash's result specifies the arbitration procedure ^ for a given status quo point 
(«o 5 ^o)- A natural choice is to take the maximin values (vi, vu) as the status quo 
point. This gives the Maximin bargaining solution. 

Example. Consider the two-person non-zero sum game with payoffs 

Hi H 2 

Ix / (1,2) (8,3) \ 

Consider the two zero-sum games for each player separately. Using the LP 
approach we find the maximin values of v\ — 3^ and vu = 2|. 
The negotiation set of Pareto optimal points is given by 

B = {(u,v) : u + = 20, 4 < u < 8} . 

Thus we wish to maximize over B 

f(u, v) = (u - u )(v - Vq) = ( u - 3- J ( v - 2^ ) 



37 V 2 



u — 3- 1 | 5 u — 2- 

3 4 2 



This gives (u*,v*) = (6f,3|) as the unique solution to the Nash arbitration 
procedure for the maximin bargaining solution. 
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20.1 Characteristic function 



We can extend our definition of (Nash) equilibrium pairs to n-person games. The 
n- tuple of strategies p*,2?2, ... where player i plays mixed strategy p* is an 
equilibrium n-tuple if for all other strategies, pi,P2, ■ ■ ■ ,p n , 

ei(p$,P2, . . . ,p*, . . . > ei(pf,P2, • • • :Pi, ■ ■ ■ :Pn) : i = 1, 2, . . . , TO . 

If there are n > 2 players in the game then there might be cooperation between 
some, but not necessarily all, of the players. We can ask which coalitions of 
players are likely to form and what are the relative bargaining strengths of the 
coalitions that do form. 

Label the players 1,2, ... ,n. A coalition of players, S, is then a subset of 
AT = {l,2,...,n}. The worst eventuality is that the rest of the players unite and 
form a single opposing coalition N \ S. This is then a 2-person non-cooperative 
game and we can calculate the maximum payoff that S can ensure for itself using 
the maximin criterion. 

Let v(S) denote the maximum value v(S) that coalition S can guarantee itself 
by coordinating the strategies of its members, no matter what the other players 
do. This is the called the characteristic function. By convention, we take 
v (0) = 0. The characteristic function measures the strengths of possible coalitions. 
Note that for any two disjoint sets S and T, we have superadditivity, i.e., 



20.2 Imputations 

Given that a coalition forms, how should v(S) be shared between its members? 
The distribution of individual rewards will affect whether any coalition is likely to 
form. Each individual will tend to join the coalition that offers him the greatest 
reward. 

An imputation in a n-person game with characteristic function v is a vector 
x = (xi,X2, • • • , x n ) satisfying 



v(SUT) >v(S)+v(T). 



n 



(1) 




Xi = v(N) , and (2) Xi > v({i}) for each i = 1,2,... 



n . 



i=l 



Think of x\ as player z's reward. Let E(v) be the set of imputations. 
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Suppose that x and y are imputations. Then we know that 

n n 

^x z = v{N) = ^2 Vi , 

i=l i=l 

and so if Xi > yi then there must be some j with yj > Xj. So everyone cannot 
be better off under x than under y. However, it is possible that all members of a 
particular coalition are better off under x than under y. 
Let y G E(v). We say that y dominates x over S if 

(3) yi > Xi for all z G S ; and (4) yi < v(S) . 

ies 

(4) ensures v(S) is large enough to pay its members the amounts in y. 



20.3 The core 

The core of a game with characteristic function v is the set, C(f), of all im- 
putations that are not dominated for any coalition. The idea is that only such 
imputations can persist in pre-game negotiations. 

Theorem 20.1 x is in the core if and only if 

n 

(5) ^2xi = v{N); (6) J2 Xi - v ( S "> forallScN. 

i=i ies 

Proof. Let x satisfy (5) and (6). Putting S = {i} for each i — 1, . . . ,n shows 
that x is an imputation. To show that it is not dominated, suppose, there is a 
coalition S with yi > Xi for all i G S. But using (6) 

which violates (4). 

Conversely, suppose that x is in the core. It is an imputation and so (5) must 
hold. Now suppose, if possible, that for some coalition S condition (6) doesn't 
hold so that X^gs 3 ^ < v(S). Define e by 

= v(S)-T,i£S x i >n 

151 
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Then the imputation 

( Xi + e i E S 

Vi = { „.„ , "W- "(g) -gjgg "(W) 

W W) + pn^ ^ s 

dominates £ on 5, contradicting the assumption that x is in the core. 



20.4 Oil market game 

Country 1 has oil which it can use to run its transportation networks at a profit 
of £a per barrel. Country 2 wants to buy oil for use in farming, for a profit of 
£b per barrel. Country 3 wants to buy oil for use in manufacturing, for a profit 
of £c per barrel. Assume a < b < c. The characteristic function is 



Coalition, S 


Characteristic function, v(S) 


0, {2}, {3}, {2, 3} 





{1} 


a 


{1,2} 


b 


{1,3}, {1,2,3} 


c 



Suppose x = (xi,X2,xs) is an imputation in the core. We must have 
^1+^2+^3= v(N) = v({l, 2, 3}) = c; 



and using £\ x ieS >v(S), 



x\ > a; X2 > 0; X3 > 0; 

X\ + X2 > b; X2 + x% > 0; X\ + x% > c. 

Thus X2 = 0, x\ + X3 = c and x\>b and so the core is given by 

C(v) = {(x, 0, c — x) : b < x < c} . 

The interpretation is that countries 1 and 3 form a coalition, with 1 selling oil 
to 3 for £x per barrel, which is at least £b per barrel (otherwise, country 1 would 
be better off selling to country 2) and at most £c per barrel, otherwise, country 
3 would pay more than it values the oil. 
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20.5 The nucleolus 

A major drawback of the core is that for many games it is empty. 

For any imputation x = (x±,X2, • • • , x n ) E E(v) and for any coalition S C N 
define 

x(S) = y^Xj. 
ies 

The quantity v(S) —x(S) measures the 'unhappiness' that coalition S feels for 
the imputation x. The larger this value the larger the difference between what 
the coalition could get and what it actually gets. 

Define 0(x) to be the vector of 2 n values taken by v(S) — x(S) as S varies 
across the possible coalitions arranged in decreasing order. We use 0(x) to order 
imputations x and y. 

Writing 0(x) = (0(x)i, 0(x) 2 , . . . , 0(x) n ) we say that 0(x) < 0(y) if 0(x)i < 
0(y)i or if 0(x) fc = 0(y) fc for k = 1, 2, . . . , i - 1 and 0(x), < 0(y),. 

The nucleolus is the smallest imputation under this ordering, written 

N(v) = {x G E(v) ; 0(x) < 0(y) for all y e E(v)} . 

It can be shown that the nucleolus always exists and is unique. Also, provided 
the core is non-empty the nucleolus lies within the core. To see this, let x be in 
the core. Then for any coalition S, v(S) —x(S) is zero or negative (by definition of 
the core). Thus all the entries of 0(x) for an x in the core are zero or negative and 
hence this property will have to hold for the minimal choice of 0(y) over choices 
of imputation y. But this means that such a minimizing imputation y is in the 
core. 

Thus the nucleolus is a natural interpretation for a fair division of the reward 
v(N). Consider again the oil market game. To construct the nucleolus we need 
only consider imputations in the core, C(v) = {(x, 0, c — x) : b < x < c}. 

Computing v(S) — x(S) for all possible coalitions gives 

v(0) - z(0) = v({l, 2}) - x{{1, 2}) = b-x 

v({l}) - x{{1}) = a-x v({2, 3}) - x{{2, 3}) = x - c 

v({2}) - x({2}) = v({l, 3}) - x({l, 3}) = 

^({3}) - x({3}) = x-c v({l, 2, 3}) - x({l, 2, 3}) = 

The largest nonzero element is either b — x or x — c. Thus we minimize the largest 
nonzero unhappiness by setting b {b + c)/2. Thus, the 

nucleolus is the imputation x = ((c + b)/2, 0, (c — b)/2) and 



0(x) = (o,0,0,0,i(6-c),i(6-c),i(6-c),a- \{b + c)) 



21 Shapley Value and Market Games 



21.1 Shapley value 

What might each player reasonably expect to receive as his share of the reward 
in a cooperative game? Shapley proposed three axioms that one might require 
for 4>i(v), player z's expected share in a game with characteristic function v. 

Shapley's axioms 

1. 4>i(v) should be independent of the player's label, 1, 2, . . . , n. 

3. If u and v are two characteristic functions then 

<f>i(u + v) = (f>i(u) + <f>i(v) . 

Theorem 21.1 (Shapley) The only function that satisfies Shapley's axioms is 
given by the Shapley values 

S:ieS 

The values arise by imagining the players join the game in random order. 
Player i receives the extra amount that he brings to the existing coalition of 
players S \ {i}, i.e., v(S) — v(S \ {i}). This must then be averaged over all the 
possible ways in which the players can arrive. 

For the oil market game we have 

(f>i{v) = \c+ \a - \b, faiv) = \b - \a , <f> 3 {v) = \c - \a - \b . 

The Shapley values give another solution concept for the game. However, note 
that this imputation is not in the core. 

21.2 Market games 

Some of the earliest examples of game theory can be found in the mathematical 
economics which was developed to understand the bargaining involved in trading. 
We will consider simple examples suggested by Edgeworth. 
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Suppose there are just two commodities (A for apples and B for bread, say). 
Assume there are M apple traders and N bread traders. A trader who has a units 
of apples and b units of bread has utility 

Ui(a,b), i- 1,2, ...,M + N. 

Assume also that the functions are concave so that every trader prefers some 
combination of the two commodities rather than either of the two extremes (a, 0) 
or (0, b) for some a, b. Hence, 

Ui (A(ai, &i) + (1 - A)(a 2 , b 2 )) > Xu { (a±, b±) + (1 - X)Ui (a 2 , b 2 ) 

for < A < 1 and i = 1, 2, . . . , M + N. 

Suppose that each trader has the same utility function u(x,y) and that each 
trader of type A or B starts with a and b units of commodities A and B respec- 
tively. Suppose that coalition S consists of s± traders of type A and s 2 traders of 
type B. The best S can ensure for itself is the highest possible sum of the utilities 
of its members that can be obtained when they trade with each other. Thus, 

Sl+S 2 

v(S) = max u(xj,yj) 

Xl ,...,X s ^-\- S 2 ,yi >---)2/s- L + S2 

1 = 1 

where 

S1+S2 S1+S2 

Xj = sia; yt = s 2 b . 

i=l i=l 

By concavity of u(-, •) we have 

Sl + S 2 / S \ 

u(xi, yi) < (si + s 2 )u ( — a, — b J = v(S) . 

r-f V s i + S 2 S1 + S2 J 

1=1 x 7 

[1, 1] -market game. The characteristic function is 

v({l}) = u(a,0); v({2}) = u(0,by, <{h 2}) = 2u ^, h - V 

and so the set of imputations is given by 

E(v) = {{u{a, 0) + pc, u(0, b) + (l- p)c) : < p < 1} 

where c = 2u(a/2, 6/2) — u(a, 0) — u(0, 6). 

We can think of p as the price of the goods, reflecting the number of units of 
B exchanged for one unit of A. 
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[1, N] -market game. Think of trader A as a monopolist. We suspect he can 
charge as high a price as he wants, provided that it is still worth the others trading. 
We illustrate this by showing that imputation 

x* = (V + l)u (^-p jj^j - Nu(0, 6), «(0, 6), . . . , u(0, &)) 

is in the core. This means showing that for any set K of k type B traders, 

Using our expression for v(S) this corresponds to showing that 

(N + l)u ( — — , J™-\ — (N — k)u(0, b)>(k + l)u ( — ^— , -^M 
v ; \N + l'N + lJ v JViJ-V J \k + l'k + lj 

which follows directly from the concavity of •). 

21.3 Competition between firms 

We now look at games of competition amongst firms. The case of two firms is 
known as duopoly; the case of more than two firms it is known as oligopoly. 
Duopoly can be regarded as a [2, oo] -market game and oligopoly can be regarded 
as a [M, oo]-market game. The first type of traders are the firms who produce a 
particular product. The second type are the buyers, or consumers, who exchange 
money for the product. 

We represent the consumer's requirements by one utility function 

u(pi,P2, ■ ■ ■ ,PM,qi,Q2, ■ ■ -,Qm) 

where pi is firm i's price for the product and qi is the amount of that firm's 
product that is bought by consumers. 

Let us assume that consumers are told the prices, pi, and then choose the 
quantities, qi, so as to maximize the above utility. Hence, this reduces to a set 
of price-demand equations which connect the demand qi for firm i's product with 
the (announced) prices pi,P2, • • • iPm, so that, say, 

Qi = fi{Pi,P2, ■ ■ -,Pm) ■ 
Firm i's utility is given by its profit 

e^(pl,P2, • • • ,Pm) = PiQi - Ci(qi) 

where Cj(-) is the production cost function for firm i. 

A similar story can be told if we suppose that firms decide on the quantities 
qi that they will produce and then the consumers' utility function determines the 
prices pi they will pay for these products. 
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21.4 Cournot equilibrium 

A Cournot equilibrium is a vector of prices (pi,P2, • • • ,Pm) such that 

Zi{Pi,P C 2i ■ ■ ■ ,Pm) = maxei(pi,P2, • • • ,Pi, ■ ■ ■ ,Pm) 

Pi 

for all firms i = 1, . . . , M. That is, p c is an equilibrium n-tuple in a n-person 
non-cooperative game of price competition. Notice that we cannot apply Nash's 
theorem since there are an infinite number of possible choices of prices, i.e., of 
pure strategies. Nevertheless, it can be shown that under reasonable assumptions 
a Cournot equilibrium always exists. 



Example. Consider a duopoly where the price-demand functions are 

Qi = /i(pijP2) = max {1 + \p 2 - |pi,0} 
Q2 = h(p\,P2) = max{l + \pi - 1^2,0} 

and suppose, for simplicity, that c\{qi) = 02(^2) = 0. 

We have that < p\ < 2 + |p 2 and < p 2 < 2 + \p\- The profit functions 
are then given by 

ei(pi,P2) = Pi + \p1P2 ~ \p\ 
e2(Pi,P2) = P2 + \P1P2 - \p\ ■ 
To find the Cournot equilibrium we must solve 

dei(p u pl)/dpi = de 2 (pl,P2)/dp 2 = 0. 

This gives equations for (pf,^) °^ 

which gives the Cournot equilibrium as pi = yy, p\ = j| , and so 

e 1 ( T f, T f) = 1.06; e2 (i6,i5) = 0.93. 

The maximization conditions dei(jp\,p2)/dpi = express the price firm 1 will 
choose given firm 2's price and vice versa. Thus, 

Pi = 9i{P2) = 1 + \P2 , P2 = 02 (pi) = 1 + 1P1 • 

Suppose firm 1 must announce its price before firm 2. Firm 2 will choose its 
price to maximize its profit given p\. Thus, it will choose p 2 = ^(Pi)- Firm 1, 
realizing this will happen, will maximize its profit by choosing pi to maximize 
ei(Pi? 02 (Pi))- Firm 1 then announces this price. 



22 Evolutionary Games 

22.1 Stackleberg leadership 

Stackleberg introduced the type of strategy introduced in Section 21.4 the firm 
who announces their price first is called the leader and the one that reacts is 
called the follower. 

Continuing our earlier numerical example, if firm 1 is the leader he should 
choose pi to maximize 

ei(pi,02(pi)) = Pi + + \pi) ~ \p\ 

which gives p\ — |. So, firm 2 will choose a price, P2 = #2(^1) — The 
corresponding profits are 

ei (|,{) = 1.067; e 2 (|,J) = 0.98. 

If firm 2 is the leader then p\ = p2 = § and the profits are 

ei(§,§) = 1.125; e 2 (|, |) = 0.9375 . 

What if they both try to be leader? They will choose prices p\ — | and p 2 = \ 
with profits 

ei(|,f) = 1.12; e 2 (f,|) = 0.975. 

Thus, a firm does better than the Cournot equilibrium when it announces its 
price, but does even better when the competing firm also announces its price. 

A similar analysis can be made when firms 1 and 2 state the quantities they 
plan to produce and then the prices adjust so that these quantities are sold. (The 
names Cournot and Bertrand are sometimes uses to distinguish between models 
of quantity competition and price competition, respectively. In fact, Cournot was 
the first to study both types of model, so we also associate his name with the 
price competition case.) 

22.2 Joint maximization of profits 

If we think of duopoly as a cooperative game then we are led to consider the joint 
maximization of profits. The two firms choose their prices (pi,p 2 ) to maximize 

e 1 {pi,p 2 ) + e 2 {pi,p 2 ) ■ 

This has the same value, when maximized, as the characteristic function value 
v({l, 2}) for the game when considered as an n-person game. Such a price vector 
is Pareto optimal but there are many other such Pareto optimal price vectors. 
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Example. Continuing the numerical example, we wish to maximize 

ei(pi,P2) + e 2 (pi,P2) =Pi + \v\Vi ~ \p\ +P2 + \p1P2 ~ \v\ ■ 
Setting derivatives to zero gives 

1 + J2P2 ~ Pi = ; 1 + j^pi -p 2 = 
which gives Pi = P2 = The profits are then 

ei (f,f) = 1.44; e 2 (f,f) = 0.96. 
The maximin values for the prices are given by finding 

max mine 1(^1,^2) and max min 62(^1,^2) • 

Pi P2 P2 Pi 

Thus, using the expressions for e\{pi,p2) and 62(^1,^2), we find that firm l's 
maximin value ensures a profit of 0.5, where p\ — 1 and P2 = 0. Firm 2's maximin 
value ensures a profit of 0.5, where pi = and P2 = 1. 

To find the negotiation set for the Nash bargaining game we must find all 
Pareto optimal prices. That is, price vectors (Pi,^) sucn that there are no other 
(pi,P2) with 

e l (pi,P2)>e l (p\,p* 2 ), i = 1, 2 

and with strict inequality in at least one component. 

We may find such price vectors by solving for all values of c the problem 

maximize ei(j?i,£>2) 

subject to 

62(^1,^2) >c, P11P2 feasible. 
We could apply Lagrangian methods to solve this problem. 

22.3 Evolutionary games 

Suppose that some individual uses a (behavioural) strategy, x say, from some set 
of possible strategies and that on meeting another individual who uses strategy y, 
there is a payoff e(x, y) to the first individual. The payoff is a measure of fitness 
describing the individual's likely reproductive success. 

No 'rational thought' is involved in selecting a strategy. Instead, the individual 
whose genes make it use strategy x will have offspring with the same genes. If 
payoffs resulting from strategy x, namely e(x, •), are high then it will produce more 
offspring using strategy x than a less successful gene type and will predominate 
in the population. 
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22.4 Evolutionary stable strategies 

Suppose that changes to a strategy arise through mutation. As usual, we look 
for equilibrium points, by which we mean strategies whose fitness is greater than 
that of any mutant strategy that could arise. If this is so then the next generation 
will have a smaller proportion of the mutant strategy and will eventually die out. 

Definition 22.1 Let X be the set of strategies. A strategy x* G X is an evolu- 
tionary stable strategy (ESS) if for every y G X , y 7^ x* then 



where x = (1 — e)x* + ey for sufficiently small e > 0. 

Consider y as the mutant strategy which affects a proportion e of the popula- 
tion. 

Notice that as e — > 0, x — > x* so for (*) to hold for small e we must have that 



If the inequality is strict then (22.1) certainly holds for small enough e. However, 



Hence x* G X is an ESS if and only if for every y G I, y 7^ x* (22.2) holds 
and if e(x*,x*) = e(y,x*) then (22.3) holds. 

We can think of an evolutionary game as a two-person non-zero-sum game in 
which the payoffs are ei(x, y) = e(x,y) and e2(x,y) = e(y,x). So, by the above, 
if x* is an ESS then (x*,x*) is an equilibrium pair for the corresponding game. 
However, not every equilibrium pair are ESSs. 

Example. Consider the two pure strategies: 'Hawk' vs 'Dove'. 'Hawk' means 
keep fighting until you or your opponent is injured and 'Dove' means run away. 
Suppose the winner earns V and injury means fitness decreases by D. Thus, the 
payoff matrix is of the following form 



e(x*,x) > e(y,x) 



(22.1) 



e(x*,x*) >e(y,x*). 



(22.2) 




(22.3) 



Hawk 



Dove 



Hawk 
Dove 
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Case V > D. For example, V = 4, D = 2. So we get payoffs 

Hawk Dove 
Hawk / 1 4 \ 

Dove V 2 ) 

But then the pure Hawk strategy x* = (1,0) is an ESS since for any y = 

(yA -y), 1 

e(x*,x*) = 1 > y = e(y,x*) . 

Case V < D. For example, put V = 2,D = 4. Now injury outweighs the 
advantage of winning and the payoffs are given by 

Hawk Dove 
Hawk / -1 2 \ 

Dove V 1 / 

For this game x* = (|, |) is an ESS. Note that y = (y, 1 — y), y ^ \ then 

e(x*,x*) = i = e(y,x*). 

So we must check that e(x*,y) > e(y,y). This follows as 

e(y,y) = l-2 ?/ 2 ; e(x*, y) = § - 2y . 
e(x*,y) - e(y,y) = (| - 2y) - (1 - 2y 2 ) = 2 (y - \f > . 

More generally for V, D > we find the following 

1. V > D. The Hawk strategy x* = (1,0) is always an ESS. 
(For V — D, (22.2) holds, so we must check (22.3)). 

2. V < D. The mixed strategy x* = (V/D, (D - V)/D) is an ESS. 

Notice that as D increases, the ESS strategy tends to the 'Dove' strategy. This 
is why snakes wrestle each other rather than bite! 
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23.1 Types of auctions 

Auctions are bidding mechanisms. They are described by a set of rules that 
specify the way bidding occurs, what information the bidders have about the 
state of bidding, how the winner is determined and how much he must pay. They 
can be viewed as partial information games in which a bidder's valuation of an 
item is hidden from the auctioneer and other bidders. The game's equilibrium is 
a function of the auction's rules. These rules can affect the revenue obtained by 
the seller, as well as how much this varies in successive instants of the auction. 
An auction is economically efficient, in terms of maximizing social welfare, if it 
allocates items to bidders who value them most. The design of an auction for a 
particular situation is an art. There is no single auctioning mechanism that is 
provably efficient and can be applied in most situations. 

Government contracts are often awarded through procurement auctions. Flow- 
ers, wines, art, U.S. treasury bonds and real estate are sold in auctions (and in- 
deed the Roman empire was auctioned by the Praetorian Guards in A.D. 193). 
They are used to sell oil drilling rights, or other natural resources, such as mobile 
telephone spectrum or satellite positions. Takeover battles for companies can be 
viewed as auctions. 

We can distinguish two important cases. In the private value model each 
bidder knows the value he places on the item, but does not know the values placed 
by other bidders. As bidding takes place, his valuation does not change, though 
he gains information from the other players' bids. In the common value model 
the item's actual value is the same for all bidders, but they have different a priori 
information about that value. Think, for example, of a jar of coins. Each player 
makes an estimate of the value of the coins in the jar, and as bidding occurs he 
can adjust his estimate based on what other players say. In this case the winner 
generally overestimates the value (since he had the highest estimate), and so he 
pays more than the jar of coins is worth. This is called the winner's curse. 

Auctions can be oral (bidders hear each other's bids and make counter-offers) 
or written (bidders submit closed sealed-bids in writing). In an oral auction the 
number of bidders may be known, but in a sealed-bid auction it may be unknown. 
Some popular types of auction are the following. 

1. English auction (or ascending price auction): bids increase in small in- 
crements until only one bidder remains. 

2. Dutch auction: the price decreases continuously until some bidder calls stop. 

3. first price sealed-bid: the winner pays his bid. 
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4. second price sealed-bid (or Vickrey auction): the winner pays the second 
highest bid. 

5. all pay sealed-bid auction: highest bidder wins, but all pay their bid. 

It is not hard to see that 1 and 4 are equivalent (with the item selling for the 
second greatest valuation), and that 2 and 3 are equivalent (with the item selling 
for the greatest bid). 

23.2 The revenue equivalence theorem 

The symmetric independent private values model (SIPV) concerns the 
auction of a single item, with risk neutral seller and bidders. Each bidder knows 
his own valuation of the item, which he keeps secret, and valuations of the bidders 
can be modelled as i.i.d. random variables. Important questions are 

• what type of auction generates the most revenue for the seller? 

• if seller or bidders are risk averse, which auction would they prefer? 

• which auctions make it harder for the bidders to collude? 

• can we compare auctions with respect to strategic simplicity? 
Let us begin with an intuitive result. 

Lemma 23.1 In any SIPV auction in which the bidders bid optimally and the 
item is awarded to the highest bidder, the order of the bids is the same as the 
order of the valuations. 

Proof. Let e(p) be the minimal expected payment that a bidder can make if he 
wants to win the item with probability p. A bidder who has valuation v and aims 
to win with probability p can make expected profit tt(v) = pv — e(p). Suppose 
the best p is p* (which depends on v) so the maximal profit is defined by 

= v-e'(p*) = 0. (23.1) 

p=p* 

Note that e(p) is convex and that at the stationary point, e'(p*)p* — e(p*) > e(0). 
Now as dir* /dp* = v — e'(p*) = 0, we have 

dir* dir* dir* dp* 

= 1 t— 

dv dv dp* dv 



tt*(v) = max [pv — e(p)] = p*v — e(p*) 
p 




(23.2) 
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If the bidder had a valuation of v +5 he could make the same bid that he makes 
with valuation v, in which he would win with the same probability and have 
the same expected payment; his expected profit would increase by dp*. Hence 
tt(v + 5v) > tt(v) + 5tt'(v), implying tt is convex in v. Hence dn* (v) / dv , and so 
also p*(v), must increase with v. Since the item goes to the highest bidder the 
optimal bid must also increase with v. ■ 

We say that two auctions have the same bidder participation if any bidder 
who finds it profitable to participate in one auction also finds it profitable to 
participate in the other. The following is a remarkable result. 

Theorem 23.1 (Revenue equivalence theorem) The expected revenue obtained 
by the seller is the same for any two SIPV auctions that (a) award the item to 
the highest bidder, and (b) have the same bidder participation. 

It is remarkable, as various auctions can have completely different sets of rules 
and strategies. Suppose there are n bidders and all participate. 



Proof. From (23.1) we have de(p(v))/dv = vdp/dv. Integrating this gives 

pV rV 

e(p(v)) — e(p(0)) + / wp' (w) dw = vp(v) — / p{w)dw 1 (23.3) 

J o Jo 

where clearly e(p(0)) = e(0) = 0, since there is no point in bidding for an item 
of value 0. Thus e(p(v)) depends only on the function p(w). We know from 
Lemma 23.1 that if bidders bid optimally then bids will be in the same order as 
the valuations. It follows that if F is the distribution function of the valuations, 
then p(w) = F(w) n ~ 1 } independently of the precise auction mechanism. The 
expected revenue is Y^i=i E Vi e-{p{vi)) = nE v e(p(v)). ■ 

Example 23.1 Assume valuations are i.i.d. with distribution function F(u). 

(a) Suppose the item is simply offered at price p and sold if any player values it 
above p. The seller computes x(p), the probability of making a sale if the price 
posted is p, and seeks to maximize px{p). Then 

x(p) = 1 - F"(p) , and P __±Z£35L_=0. 

If the distributions are uniform on [0,1], F(u) = u, the optimal price is p* = 
y/l/n + 1, and the resulting (expected) revenue is [n/(n + 1)] \/l/n + 1. For 
n = 2, p* = ^1/3, and the expected revenue is (2/3)v / T73~= 0.3849. 

(b) Suppose the item is auctioned by any of the five mechanisms above. Let n — 2. 
If all bidders bid optimally then the probabilty that a bidder with valuation v 
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wins is v, i.e. p(v) = v. From (23.3) we see that e(p(v)) = v 2 /2. So in all these 
auctions the seller's expected revenue is 2E[v 2 /2] = 1/3 = 0.3333. 

Let us compute the optimal bids in our auctions. Clearly, in the all-pay sealed- 
bid auction the optimal bid is e(p(v)) = v 2 /2. In the Dutch or first price sealed- bid 
auctions, a bidder's expected payment is p(v) times his bid. Since this must equal 
v 2 /2 we find that the optimal bid must be v/2. In the second price sealed- bid 
(Vickrey) auction, the winner pays the bid of the second highest bidder. If bidder 
1 bids u, then his profit is [v\ — V2)l{ u >v 2 }- For every possible value of t>2, this is 
maximized by bidding u — v\. 

The seller prefers (a) to (b). However, in (a) he uses information about the 
distribution of the valuations. In (b) he makes no use of such information. 

23.3 Risk aversion 

Participants in an auction can have different attitudes to risk. If a participant's 
utility function is linear then he is said to be risk-neutral. If his utility function 
is concave then he is risk-averse; now a seller's average utility is less than the 
utility of his average revenue, and this discrepancy increases with the variability 
of the revenue. Hence a risk-averse seller, depending on his degree of risk-aversion, 
might choose an auction that substantially reduces the variance of his revenue, 
even though this might reduce his average revenue. 

The revenue equivalence theorem holds under the assumption that bidders 
are risk-neutral. However, if bidders are risk-averse, then first-price sealed-bid 
auctions give different results from second-price sealed-bid auctions. In a first- 
price sealed-bid auction, a risk-averse bidder prefers to win more frequently even if 
his average net benefit is less. Hence he will make higher bids than if he were risk- 
neutral. This reduces his expected net benefit and increases the expected revenue 
of the seller. If the same bidder participates in a second-price auction, then his bids 
do not affect what he pays when he wins, and so his optimal strategy is to bid his 
true valuation. Hence, a first-price auction amongst risk-averse bidders produces 
a greater expected revenue for the seller than does a second-price auction. 

The seller may also be risk-averse. In such a case, he prefers amongst auctions 
with the same expected revenue those with a smaller variance in the sale price. 

Let us compare auctions with respect to this variance. Suppose bidders are 
risk-neutral. In the first price sealed-bid auction each bids half his valuation, so 
the revenue is (1/2) max{V"i, V^}- In the all-pay sealed-bid auction each pays half 
the square of his valuation and the revenue is ^V 2 + \V 2 , where V±, V2 ~ U[0, 1]. 
In the Vickrey auction each bids his valuation and the revenue is min{Vi, V2}. All 
these have expectation 1/3, but the variances are 1/72, 2/45 and 1/18 respectively. 
Thus a risk adverse seller preferse the first price auction to the all-pay auction, 
which is preferred to the Vickrey auction. 



24 Optimal and Multi-unit Auctions 



24.1 Optimal auctions 

Revenue- maximizing auctions are called optimal auctions and are achieved by 
imposing a reserve price or a participation fee. This reduces the number 
of participants, but leads to fiercer competition and higher bids on the average, 
which compensates for the probability that no sale takes place. 

Example 24.1 (Revenue maximization) As in Example 23.1, suppose that 
there are two potential buyers, with unknown valuations, t>i, V2, independent and 
uniformly distributed on [0, 1]. Considers two more ways of selling the object. 

(c) Suppose the item is auctioned with an English auction, but with a participation 
fee c (which must be paid if a player chooses to submit a bid). Each bidder 
must choose whether or not to participate before knowing whether the other 
participates. Clearly, there is a vq such that a player will not participate if he 
values the item at less than vq. A bidder whose valuation is exactly vq will be 
indifferent between participating or not. Hence P (winning \ v — vq)vq — c. Since 
a bidder with valuation vq wins only if the other bidder has a valuation less than 
^o, we must have P (winning | v = vq) = i>o, and hence v 2 , = c. Thus vq = a/c. 

We note that there are two ways that revenue can accrue to the seller. Either 
only one bidder participates and the sale price is zero, but the revenue is c. Or 
both bidders have valuations above ^o, in which case the revenue is 2c plus the 
sale price of minimi, i^}. The expected revenue is 

2u (l - v )c + (1 - v ) 2 [2c + v + (1 - u )/3] . 

This is maximized for c = 1/4 and takes the value 5/12 (= 0.4167). 

(d) Suppose the item is auctioned with an English auction, but with a reserve price 
p, such that the bidding starts with a minimum bid of p. There is no sale with 
probability p 2 . The revenue is p with probability 2p(l — p). If minimi, t^} > P-, 
then the sale price is min{vi, t^}- The expected revenue is 

2p 2 {l-p) + {\ + lp){l-p) 2 . 

This is maximized by p = 1/2 and the expected revenue is again 5/12, exactly 
the same as in (c). 

That cases (c) and (d) give the same expected revenue is not a coincidence. In 
both auctions a bidder participates if and only if his valuation exceeds 1/2. Let 
us consider more generally an auction in which a bidder participates only if his 
valuation exceeds some vq. Suppose that with valuation v it is optimal to bid so 
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as to win with probability p(v), and the expected payment is then e(p(v)). By a 
simple generalization of (23.3), we have 



e(p(v)) = e(p(v )) + 



w 



v 



dp(w) 
dw 



PV 

dw = vp(v) — / p(w) 

Jvq 



dw 



Assuming the SIPV model, this shows that a bidder's expected payment depends 
on the auction mechanism only through the value of Vq that it implies. The seller's 
expected revenue is 



nE v [e(p(v))] = n 



'v=v 



vp(v) 



p(w) dw 



' W = Vq 



f{v)dv 



n 



vp(v)f(v) dv — n 



p{w)f(v) dw dv 



v=v 

'OO 



W = Vq J V = W 



n 



POO 

/ {vf(v) - [1 - F{v)] }F{v) n - 1 dv 

J V = VQ 



Now differentiating with respect to vo, to find the stationary point, we see that 
the above is maximized where 

v Q f{v o )-[l-F{v o )]=0. 

We call vq the optimal reservation price. Note that it does not depend on the 
number of bidders. For example, if valuations are uniformly distributed on [0, 1], 
then vo = 1/2, consistent with the answers found for (c) and (d) above. 

If bidders' valuations are independent, but not identically distributed, we can 
proceed similarly. Let Pi(v) be the probability that bidder i wins when his valu- 
ation is v . Let ei(p) be the minimum expected amount he can pay if he wants to 
win with probability p. Suppose that bidder i does not participate if his valuation 
is less than voi- Just as above, one can show that the seller's expected revenue is 



i=l 



-I J v=v 0i 



V 



l-Fj(v) 



fi{v)pi(v) dv 



(24.1) 



The term in square brackets can be interpreted as 'marginal revenue', in the 
sense that if a price p is offered to bidder i, he will accept it with probability 
x i{p) — 1 — Fi(p), and so the expected revenue obtained by this offer is pxi(p). 
Therefore, differentiating pxi(p) with respect to Xi, we define 



MRiip) 



-(rW) = S («W)/^ 



dx 



P 



The right hand side of (24.1) is simply E[MRi* (i>i*)], where i* is the winner of 
the auction. This can be maximized simply by ensuring that the object is always 
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awarded to the bidder with the greatest marginal revenue, provided that marginal 
revenue is positive. We can do this, provided bidders reveal their true valuations. 
Let us assume that MRi(p) is increasing in p for all i. Clearly, vqi should be the 
least v such that MRi(v) is nonnegative. Consider the auction rule that always 
awards the item to the bidder with the greatest marginal revenue, and then asks 
him to pay the maximum of Voi and the smallest v for which he would still remain 
the bidder with greatest marginal revenue voi • This has the character of a second- 
price auction in which the bidder's bid does not affect his payment, given that he 
wins. So bidders will bid their true valuations and (24.1) will be maximized. 

Example 24.2 An interesting property of optimal auctions with heterogeneous 
bidders is that the winner is not always the highest bidder. 

Consider first the case of homogeneous bidders with valuations uniformly dis- 
tributed on [0, 1]. In this case MRi(vi) — vi — (1 — Vi)/1 — 2vi — 1. The object 
is sold to the highest bidder, but only if 2vi — 1 > 0, i.e., if his valuation exceeds 
1/2. The winner pays either 1/2 or the second greatest bid, whichever is greatest. 
In the case of two bidders with the identical uniformly distibuted valuations the 
seller's expected revenue is 5/12. This agrees with what we have found previously. 

Now consider the case of two heterogeneous bidders, say A and 5, whose val- 
uations are uniformly distributed on [0, 1] and [0,2] respectively. So MRa{va) = 
2va — 1, vqa = 1/2, and MRb{vb) = 2vb — 2, vqb = 1- Under the bidding rules 
described above, bidder B wins only if 2vb — 2 > 2va — 1 and 2vb — 2 > 0, i.e., 
if and only if vb — va > 1/2 and vb > 1; so the lower bidder can sometimes win. 
For example, if va = 0.8 and vb = 1-2, then A wins and pays 0.7 (which is the 
smallest v such that MR A {v) = 2v - 1 > 2v B - 2 = 0.4). 

24.2 Multi-unit and combinatorial auctions 

Multi-unit auctions are of great practical importance, and have been applied 
to selling units of bandwidth in computer networks and satellite links, MWs of 
electric power, capacity of natural gas and oil pipelines. These auctions can 
be homogeneous or heterogeneous. In a homogeneous auction a number of 
identical units of a good are to be auctioned, and we speak of a multi-unit 
auction. In the simplest multi-unit auction, each buyer wants only one unit. The 
auction mechanisms above can be generalized. For example, in a simultaneous 
auction of k units, all bidders could make closed sealed-bids, and the k objects 
could be awarded to the k highest bidders. In a first-price auction each bidder 
would pay his own bid. In a generalization of the Vickrey auction the k highest 
bidders would pay the value of the highest losing bid. It can be shown that the 
revenue-equivalence theorem still holds for these auctions. Note that in the first- 
price auction the successful bidders pay differently for the same thing; we call 
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this is a discriminatory auction. By contrast, the Vickrey auction is called 
a uniform auction, because all successful bidders pay the same. A uniform 
auction is intuitively fairer, and also more likely to reduce the winner's curse. 

Things are more complex if bidders want to buy more than one object, or 
if objects are different, and perhaps complementary. For example, the value 
of holding two cable television licenses in contiguous geographic regions can be 
greater than the sum of their values if held alone. This means that it can be 
advantageous to allow combinatorial bidding. Here, bidders may place bids on 
groups of objects as well as on individual objects. A generalization of the Vickrey 
auction that can be used with combinatorial bidding is the Vickrey- Clarke- 
Groves (VCG) mechanism. Each bidder submits bids for any combinations of 
objects he wishes. The auctioneer allocates the objects to maximize the aggregate 
total of their values to the bidders. Each bidder who wins a subset of the objects 
pays the 'opportunity cost' that this imposes on the rest of the bidders. 

More specifically, let L be the set of objects and P be the set of their possible 
assignments amongst the bidders. Each bidder submits a bid that specifies a value 
Vi(T) for each non-empty subset T of L. An assignment S G P is a partition of 
L into subsets Si, with one such subset per bidder i (possibly empty). If social 
welfare maximization is the objective, then the auctioneer chooses the partition 
S = {'S'i , . . . , S n } that maximizes Y^i=\ v i(^i)- Bidder i pays pi, where 

pi = ^E^-(^) - E^-(^) • ( 24 - 2 ) 

The first term on the right of (24.2) is the greatest value that could be obtained 
by the other bidders if i were not bidding. The final term is the value that is 
obtained by the other bidders when bidder i does participate, and so influences 
the optimal allocation and takes some value for himself. 

This type of auction is incentive compatible, in the sense that bidders 
are incentivized to submit their true valuations, and it leads to an economically 
efficient allocation of the objects. It has the advantage that the whole market is 
available to bidders and they can freely express their preferences for substitutable 
or complementary goods. However, there are drawbacks. Firstly, the complex 
mechanism of a VCG auction can be hard for bidders to understand. It is not 
intuitive and bidders may well not follow the proper strategy. Secondly, it is 
very hard to implement. This is because each bidder must submit an extremely 
large number of bids, and the auctioneer must solve a ./VP-complete optimization 
problem to determine the optimal partition (and also each of the pi), so the 
'winner determination' problem can be unrealistically difficult to solve. There 
are several ways that bidding can be restricted so that the optimal partitioning 
problem becomes solvable in polynomial time. Unfortunately, these restrictions 
are rather strong, and are not applicable in many cases of practical interest. 
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